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Description 

[0001] The present invention relates to a novel trancription factor of MHC class II genes and its derivatives, inhibi- 
tors of this transcription factor capable of down-regulating the expression of MHC class II molecules and medical uses 
5 of these inhibitors. 

[0002] MHC class II molecules, for example HLA-DR, HLA-DQ and H LA-DP in humans are transmembrane het- 
erodimers that are essential for antigen presentation and activation of T lymphocytes. They are encoded by a multi- 
gene family and are highly regulated in their expression. 

[0003] Abnormal or aberrant expression of MHC class II genes leads to an aberrant T cell activation, which leads 
10 to an abnormal immune response. 

[0004] Such abnormal immune response is a cause of inflammation events, autoimmune diseases or rejections of 
transplanted organs. 

[0005] There is an important need to downregulate the expression of MHC class II molecules in order to treat or 
prevent the above cited clinical events. 
is [0006] A powerful means of obtaining MHC class II downregulation or immunosuppression consists in intervening 
on the transcription (transcriptional intervention) of the MHC class II genes. 

[0007] Transcriptional intervention can be achieved by acting on the transcription factors which are involved in the 
transcription of MHC class II genes. 

[0008] The transcription factor which is the target of the transcriptional intervention has to be essential and specific 
20 for the expression of the genes that one wants to inhtoit. 

[0009] Such essential and specific transcription factors are however rare. 
[001 0] Therefore, there is a great need to identify such a transcription factor. 

[0011] The invention relates to such a new transcription factor called RFX-ANK which is a subunit of the heterot- 
rimeric transcription complex called RFX that binds to the conserved X box motif of all MHC II promoters. 
25 [0012] Much of what we know today about this important transcriptional factor called RFXANK results from the 
study of a rare genetic disease related to MHC class II deficiency. 

[001 3] MHC class 1 1 deficiency is an unusual autosomal recessive disease in which the genes implicated in the phe- 
notype (MHOII genes) are in fact intact. Instead, the original defect results from mutations in several different trans-act- 
ing regulatory genes responsible for the regulation and expression of MHOII genes 1 - 2 . 

30 [0014] In MHC-II deficiency, there is a complete loss of expression of all MHC-II genes, which leads to a form of 
severe primary immunodeficiency 1 > 2 . Patients have repeated infections and frequently die before the age of 1 0. All clin- 
ical manifestations and phenotypic features of these patients can be attributed to the defect in the expression of MHC- 
II genes 3 4 . Because of the key role of the tight and complex regulation of MHC-II genes in the control of the immune 
response, the recognition of MHOII deficiency as a disease of gene regulation has led to a wide interest in the search 

35 for the regulatory factors that are involved. These factors are important elements controlling the development, homeos- 
tasis and effector functions of the immune system. 

[0015] Although clinically homogeneous, the disease is genetically heterogeneous. There are four distinct genetic 
complementation groups (A-D, refs.5-7), which reflects the existence of four essential regulators of MHOI I expression 2 . 
The regulatory genes that are defective in complementation groups A (MHC2TA), C (RFX5) and D (RFXAP) have been 
40 identified 8 " 10 CIITA (MHC2TA gene product) is a non DNA binding co-activator, whose expression controls the cell type 
specif icity and indudbility of MHOII genes 811 ' 12 . RFX5 and RFXAP are two subunits of RFX, a multi-protein complex 
that binds to the conserved X box motif of all MHC-II promoters 1314 . Mutations in either RFX5 or RFXAP abolish bind- 
ing of RFX and transcription of MHC-II genes 910 ' 15 . 

[001 6] The molecular defect responsible for complementation group B, although it accounts for the great majority 
45 of MHC-II deficiency patients, has remained elusive. Like complementation groups C "and D, group B is characterized 
by a specific lack of binding of the RFX complex to the X box of MHC-II promoters 2 ' 13 ' 14 , suggesting a defect in an addi- 
tional subunit of RFX. However, neither complementation of defective cells, an approach that had led to the cloning of 
MHC2TA (ref.8) and RFX5 (ref.9) genes, nor classical affinity purification of the RFX complex, which had allowed the 
cloning of RFXAP (ref.10), have permitted us, or other laboratories, to identify the gene that is affected in cornpiemen- 
so tation group B. 

[001 7] The fact that the great majority of all MHOII deficiency patients belong to group B had highlighted this group 
as a challenge. This remained the case after the doning of the three regulatory genes affected in groups A. C and D 
(refe.8-10). We and others have attempted to solve group B by a variety of strategies. These have included functional 
complementation assays similar to those that had led to the cfiscovery of CIITA (ref.8) and RFX5 (ref.9), classical muft- 
55 istep purification, as was used in the identification of RFXAP (ref.10), mutagenesis by retroviral insertion, two hybrid 
selection systems in yeast and genetic localization by linkage studies. The approach that finally turned out to be suc- 
cessful is an efficient single step DNA affinity purification procedure that exploits the remarkable stability of a large 
mufti-protein complex formed by RFX, X2BP, and NF-Y, which bind cooperatively to a longer segment of MHOII pro- 
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meters (the X-X2-Y box region) 18 ' 19 - 23 (see Fig. 6). The very high stability of this large complex represents an obvious 
advantage in terms of th yield and enrichment obtained in th purification procedure. Another major advantage is that 
it allows the selection of those factors that ar part of a physiologically relevant multi-protein— DNA complex, at the 
expense of numerous other proteins capable of binding individually to the isolated X, X2, and Y motifs. Thus, in addition 
to RFX purification, it allows the co-purification of the biologically relevant X2 and Y box-binding factors. 
[0018] Thus, here we report the isolation of a novel subunit of the RFX complex by an efficient single-step DNA- 
affinity purification approach. This procedure takes advantage of a strong co-operative protein-protein interaction that 
results in the highly stable binding of three distinct multi-protein transcription factors — RFX, X2BP (ret 16) and NF-Y 
(ret 1 7) —to the X-X2-Y region of MHC-II promoters 10 ' 18 ' 19 . The higher order RFX — X2BP — NF-Y complex formed on 
DNA contains the physiologically relevant proteins involved in the activation of MHC-II promoters. With the use of this 
new affinity purification procedure, the novel component of the RFX complex called RFXANK was identified and the cor- 
responding gene was isolated. 

[0019] This gene is capable of fully correcting the MHC-II expression defect of ceil lines from several patients in 
complementation group B and it was indeed found to be mutated in these patients. The amino acid sequence revealed 
a series of ankyrin repeats, a well defined protein-protein interaction motif 20 ' 21 , and this novel factor was therefore 
called BFXANK We also demonstrate here that RFXANK is essential for binding of the RFX complex to the X box of 
MHC -II promoters. 

[0020] RFXANK is shown here to have a dramatic effect on the binding of the RFX complex to the X box motif. 
RFX5, which possesses a characteristic DNA-binding domain (the RFX DBD motif 24 ), cannot bind to its natural DNA 
target either alone 9 , or in the presence of RFXAP (ret 10). The X box-binding activity is restored, however, when RFX5 
and RFXAP are co-translated together with RFXANK, demonstrating that the presence of each of the three subunits is 
required (Fig. 5). The existence of ankyrin repeats within RFXANK suggests that this motif mediates interactions 
between the different subunits of the RFX complex. Ankyrin repeats are protein-protein interaction motifs known to be 
implicated in a wide range of biologically important regulatory events, such as the binding of the p53 binding protein 
(53BP2) to its target 30 , of IkB to NF-kB (refs.31 ,32) and of INK4 kinase inhibitors to CDK4/6 (ref.33). However, ankyrin 
repeats have been described only rarely in transcription factors. An example of particular interest is GABP, a het- 
erodimeric transcription factor composed of a DNA-binding a subunit and a b subunit containing ankyrin repeats 28 ' 29 . 
Interestingly, the binding of GABPb to GABPa — which is mediated by the ankyrin region of GABPb — greatly 
enhances the affinity of GABPa to its DNA target 34 ' 35 . By analogy, we propose that interactions between RFXANK and 
RFX5, in the presence of RFXAP, induce structural alterations that allow the DNA binding domain of RFX5 to bind the 
X box of MHC-II promoters. Alternatively, the ankyrin repeats of RFXANK may be involved in other functionally essential 
interactions, such as the cooperative interactions that stabilize the higher order RFX— X2BP— NF-Y complex or direct 
contacts between RFX and the co-activator CIITA. 

[0021] The human RFXANK gene was identified in genomic sequences available in GenBank. This allowed us to 
establish rapidly the entire intron-exon organization of the RFXANK gene (see Rg. 5). Furthermore, it allowed us to 
confirm and extend our initial chromosomal mapping to 19p of the gene involved in group B of MHOII deficiency. This 
localization was obtained from linkage studies with a small number of informative families. The RFXANK gene is 
located on human chromosome 19, at 19p12, between markers D19S566 and D19S435 (see GenBank accession 
numbers AD000812 and AC0031 10). The availability of the RFXANK gene will make possible an analysis of potential 
polymorphism. We suggest that the RFXANK gene, like the three other transactivator genes that control MHC-II expres- 
sion, should be examined closely for possible polymorphic cfiversity, either in coding or control regions. This may be rel- 
evant to various forms of immunopathology. 

[0022] RFX5 and RFXAP are expressed constftutively in all cell types tested and the corresponding genes are not 
known to be regulated. It is likely that expression of the RFXANK gene exh torts the same ubiquitous pattern since the 
corresponding ESTs (over 80 different clones) have been isolated from a wide variety of different tissues. The gene 
encoding CIITA (MHC2TA), on the other hand, is extremely tightly regulated, with a constitutive expression that is 
restricted to professional antigen presenting cells. It is the expression of CIITA, either constitutive in certain specialized 
cells or inducible in others, that determines activation of MHC-II promoters 11,12 . Thus, two distinct regulatory compo- 
nents, the ubiquitously expressed RFX complex and the highly regulated transactivator CIITA, must act in concert. 
[0023] Several features make MHC-II deficiency unique among diseases of gene regulation. First, patients from the 
different genetic complementation groups have indistinguishable clinical features 2 " 4 . Thus, defects in four distinct and 
unrelated genes (MHC2TA, RFX5, RFXAP and RFXANK) ail result in a single disease having a remarkably homogene- 
ous clinical and biological phenotype. Second, the factors involved in this disease exhibit a very tight control over MHC- 
II gene expression. This control cannot be compensated for by other redundant regulatory mechanisms. In contrast, 
defects in many other regulatory genes such as Oct2, Obf1 or MyoD1 frequently reveal the existence of alternative 
pathways of activation of the specific genes that are under their control. Third and most important, no other genes, 
except for the MHC-II related HLA-DM and li genes, are known to be affected significantly in MHC-II deficiency. All 
observed symptoms and biochemical features of the disease can be attributed to a single defect, namely a general lack 
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of both constrtutiv and inductole expression of MHC-II genes in all cell types 2 " 4 . This indicates that ail four trans-acti- 
vating factors (CIITA, RFX5, RFXAP and RFXANK) are dedicated to th control of MHC-II genes. This restriction in spe- 
cificity is also uncommon. .. - 

[0024] Indeed, genetic defects in many transcription factors, in particular in those affected in human diseases, 
result in pleiotropic effects almost always involving complex developmental disorders (for review see 36). This is 
because the majority of regulatory genes affected in human diseases — in contrast to the four transactivators defective 
in MHC-II deficiency — are implicated in the control of multiple different target promoters and genes. In the case of the 
three subunits of RFX, the restricted specificity contrasts with their ubiquitous pattern of expression. 
[0025] The discovery of RFXANK as the gene affected in complementation group B of MHC-II deficiency has 
allowed us to complete the elucidation of the molecular and genetic basis of this disease. Two features make this dis- 
ease of gene regulation highly unusual: First defects in any one of four entirely different transcriptional transactivators 
lead to the very same clinical and phenotypic manifestations. Second, each of these four MHC-II transactivators is not 
only essential for the control of MHC-II gene expression but is also dedicated to the control of these genes, without play- 
ing any major role in the transcriptional control of other genes. 

[0026] It is known from the phenotype of patients of complementation group B and of cell lines from such patients, 
that RFX-ANK factor is indeed both essential for the activity of MHC class II genes and specific in its effect on these 
genes. 

[0027] Thus, the new transcription factor of the invention has two essential and unusual characteristics that makes 
it suitable as a target for transcriptional intervention. 

[0028] "The invention relates to a protein or a peptide capable of restoring the MHC II expression in cells from MHC 
II defiency patients in complementation group B and comprising ail or part of the amino acid sequence shown in figure 

2. 

[0029] The cells from MHC II deficiency patients in complementation group B may be chosen among BLS1 cell 
lines (reference 6), Na cell lines (reference 40) or Ab cell lines (reference 14). 

[0030] The MHC II molecules which are restored by a protein or a peptide of the invention can be chosen among 
th HLA-DR, HLA-DP or HLA-DQ molecules. 

[0031] The invention relates to a protein or a peptide comprising the amino acid sequence shown in figure 2 and 
part thereof, or an amino acid sequence having at least 80 % and preferably at least 90 % homology with the illustrated 
sequences and part thereof. The homologous sequences and parts thereof are referred to as "derivatives". 
[0032] Part of a protein or a peptide may be fragments of at least 6 amino acids, preferably at least 20 amino adds. 
[0033] The amino acid sequence shown in figure 2 is the human sequence of RFX-ANK. Derivatives of the figure 
2 sequence may be agonists or antagonists of the RFX-ANK function as def ined below. 

[0034] In a further embodiment, the invention relates to a protein or a peptide comprising the amino acid sequence 
of RFX-ANK of other species than human and part thereof, especially pig and to the amino acid sequence having at 
least 80 % and preferably at least 90 % homology with these amino acid sequences and part thereof. 
[0035] The amino acid sequence of RFX-ANK of other species can be obtained by standard methods like cross 
hybridization at low stringency (see Maniatis). 

[0036] The invention also relates to an antibody capable of specifically recognising a protein or a peptide of the 
invention. 

[0037] Such antibody can be obtained by standard methods. In vitro methods are for example ELISA assays. In 
vivo methods consist for example in administering to an organism a protein or a peptide of the invention in order to pro- 
duce antibodies specific of the administered protein or peptide. 

[0038] In vivo and in vitro methods may of course be used in a complementary way to obtain antibody. 

[0039] Antibodies of the invention may be monoclonal. Monoclonal antibody can for example be produced by the 

technique of hybridoma. 

[0040] Antibody of the invention may be single chain antibody. Techniques to generate single chain antibody are 
well known. 

[0041] The invention also relates to a nucleic acid molecule (RNA, DNA or cDNA) encoding a protein, a peptide or 
an antibody of the invention. 

[0042] The nucleic acid molecule of the invention may comprise all or part of the nucleotide sequence illustrated in 
figure 2 (GenBank Accession Number : Human RFXANK cDNA AF094760). 

[0043] A part of a nucleic acid molecule or of a nucleotide sequence corresponds to at least 18 nucleotides, and 
preferentially at least 60 nucleotides. 

[0044] The invention relates also to nucleic acid molecule capable of hybridizing in stringent condition with the 
nucleic acid molecule of the invention. 

[0045] Typical stringent conditions are those where the combination of temperature and salt concentration chosen 
to be approximately 12-20°C below the Tm (melting temperature) of the hybrid under study. 

[0046] In a further embodiment, the invention relates to a nucleic acid molecule comprising the nucleotide 
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sequence illustrated in figure 2 to a nucleotide sequence exhibiting at least 90 % identity with said nucleotide sequ nee 
or to a part of said nucleotide sequence. 

[0047] In another embodiment, the invention relates to the nucleotide sequence of the RFX-ANK gene of other spe- 
cies than human (especially pigs) to the nucleotide sequence having at least 60 % or 70% identity with said nucleotide 
5 sequence and to a part of said nucleotide sequences. 

[0048] The invention relates particularly to the mouse RFXANK nucleotide sequence (Gen Bank Accession Number 
: Mouse RFXANK cDNA AF094761 -see figure 3), to the nucleotide sequence having at least 60% or 70 % identity with 
these nucleotide sequence and to a part of said nucleotide sequences. 

[0049] In another embodiment, the invention relates to a nucleic acid molecule comprising a sequence complemen- 
10 tary to the nucleic acid molecule of the invention. 

[0050] In a further embodiment the invention relates to an anti-sense molecule or a ribozyme comprising a nucleic 
acid molecule as recited by the preceeding paragraph. 

[0051] The nucleic acid molecule of the invention may have at least 20 nucleotides and preferably at least 40. 

[0052] The invention relates also to a vector comprising a nucleic acid molecule of the invention or being able to 
is express an anti-sense molecule or a rbozyme of the invention. 

[0053] A peptide, a protein or a nucleic acid molecule of the invention is called a transcription factor of the invention. 

[0054] After the transcriptional intervention target (transcriptional factor of the invention) has been identified, the 

transcriptional intervention then requires the identification of inhibitors or inhibitory molecules (by biochemicaJ in vitro 

screening and/or by cell based screening) followed by tests of putative inhibitors in animal models. 
20 [0055] < < Inhibitory molecules ) ) or ( < inhibitors > )of a molecule are substances which have the capacity to inhibit 

a function, an activity or the synthesis of a molecule of interest. 

[0056] The molecule of interest may be a nucleic acid molecule or a peptide or a protein. 

[0057] The capacity to inhibit may be the partial or total capacity to block, repress, suppress, stop, abolish, compete 
with or downregulate, the function, or the synthesis of the molecule of interest. 
26 [0058] The molecule of interest of the present invention is a protein, a peptide or a nucleic acid molecule of the 
invention encoding said protein or peptide. The molecule is called a transcription factor of the invention. The transcrip- 
tion factors may be RFX-ANK as shown in figure 2 or derivatives thereof as described earlier. 

[0059] The function of a transcription factor of the invention, particularly the transcription factor RFX-ANK and its 
functional derivatives can be defined as the capacity to enable the functional transcription of MHC class II genes, via 

30 the RFX complex, and consequently the expression of MHC class II gene products. 

[0060] This function can be recognised as the capacity to correct the MHC II expression defect of cell lines from 
patients in complementation group B. The function of the molecules of the invention is achieved globally by a series of 
sequential steps involved. Thus, each of these steps can be considered, in the context of the invention, as being the 
direct or indirect function of the molecules of interest of the invention. 

35 [0061] Consequently, the function or activity of the molecule of interest of the invention (transcription factor of the 
invention) signifies the capacity to allow the expression of MHC class II molecules, to allow the transcription of an MHC 
class II gene, to allow the expression or the translation of an MHC class II protein or peptide, to allow the formation of 
the RFX complex, to allow the binding of the RFX complex to the X box motif, to allow the interaction between the RFX 
complex and at least one of the transcription factors X2BP, NF-Y or CIITA, to allow a cooperative interaction that stabi- 

40 lizes the higher order RFX-X2BP-N F-Y complex, to direct contacts between RFX and the co-activator CIITA, to allow 
binding of RFX5 to the X box, or to correct the MHC II expression defect of cell lines from patients in complementation 
group B. 

[0062] The synthesis of the molecule of interest of the invention (transcription factor of the invention) may be the 
transcription and/or translation of the nucleic acid molecule of the invention into the protein or peptide of the invention 
45 encoded by said nucleic acid molecule. 

[0063] Due to the fact that the transcription factor of the invention does not play any major role in the transcriptional 
control of other genes than MHC class II genes, inhibitors of said transcription factor of the invention are devoid of other 
undesirable inhibitory effects. 

[0064] Inhibitors of the invention have a very specific action limited to MHC class II genes because their target, ie 
so transcription factor of the invention has specificity restricted to promoters of MHC class II genes. 

[0065] In fact, a transcription factor of the invention does not appear to be implicated in the control of other target 
promoters and genes. 

[0066] Furthermore, inhibitors of the invention are very efficient in their action on MHC class II genes expression 
because their target is essential for the control of MHC class II gene expression. 
55 [0067] Thus, an important aspect of the invention relates to the identification of inhibitors of a transcription factor of 
the invention. 

[0068] Thus, the invention includes a process for identifying inhibitors of a protein, a peptide or a nucleic acid mol- 
ecule of the invention. 
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[0069] Potential inhibitors tested may be of natural or synthetic origin and preferably of low molecular weight. 
[0070] The candidates tested may be proteins, peptides, amino acids, nucleic acids, antibodies or other molecules . 
and may come from large collection of organic molecules which are readily available in the form of chemical libraries. 
Chemical libraries include combinatorial libraries or < ( phage display >) libraries of peptides or antibodies. . 
5 [0071] Small molecules may be tested in large amounts using combinatorial chemistry libraries. 

[0072] In a first part of this important aspect, the invention concerns inhibitors which have the capacity to inhibit a 
function or an activity of a transcription factor of the invention. 

[0073] As indicated above, the function or activity of the molecule of interest of the invention (transcription factor of 
the invention) may be to allow the expression of MHC class II molecules, to allow the transcription of an MHC class II 

io gene, to allow the expression or the translation of an MHC class II protein or peptide, to allow the formation of tne RFX 
complex, to allow the binding of the RFX complex to the X box motif, to allow the interaction between the RFX complex 
and the transcription factor XZBP, NF-Y or CIITA to allow a cooperative interaction that stabilizes the higher order RFX- 
X2BP-NF-Y complex, to direct contacts between RFX and the co-activator CIITA, to allow binding of RFX5 to the X box, 
or to correct the MHC II expression defect of cell lines from patients in complementation group B. 

15 [0074] Said inhibitors may be antibodies of the invention and especially single chain antibodies, derivatives of a pro- 
tein or a peptide of the invention and especially dominant negative mutants, proteins or peptides or small molecular 
weight molecules. 

[0075] Derivatives of a protein or a peptide of the invention are any molecules which comprise part of a protein or 
a peptide of the invention. 

20 [0076] Said part of a protein or a peptide of the invention may comprise at least 6 amino acids, preferably at least 
20 amino acids. 

[0077] Said part may preferably comprise or consist of all or part of the Ankyrin repeat-containing region. Ankyrin 
repeat-containing region of humans is shown at the bottom in figure 3 (identif ied as ankl , ank2, ank3). 
[0078] Dominant negative mutants of a protein or a peptide of the invention may be generated with known tech- 
25 niques as PCR mutagenesis or N or C-terminal deletion libraries. Dominant negative mutants may be generated as 
described in reference 51 . 

[0079] Antibodies of the invention are defined above . 

[0080] Inhibitors of the invention may be identified or are identifiable by the following processes which are part of 
the invention. 

30 [0081 ] The invention relates to a process for identifying inhfoitors which have the capacity to inhibit a function or an 
activity of a transcription factor of the invention. The precise process steps may be chosen in view of the function or 
activity of the transcription factor of the invention which is qualitatively or quantitatively detected. 
[0082] In a first embodiment, the detected function is the expression of MHC class 1 1 molecules. Then the invention 
relates to a process for identifying inhibitors which have the capacity to inhibit the expression of MHC class 1 1 molecules 

35 at the surface of cells or inside the cells. 

[0083] If the expression of MHC class II is measured at the surface of cells, the detected function is the translation 
or expression of MHC II proteins or peptides. 

[0084] Candidate inhibitors may then be tested for their capacity to inhibit said function in simple robust cell based 
assays of the expression of MHC class It molecules at the surface of cells. Such assays, based on detection with avail- 

40 able monoclonal antibodies, are readily available. They involve both the detection of inhibition of constitutive expression 
of MHC class II on B lymphocyte cell lines and/or the detection of inhibition of induction of MHC class II expression by 
interferon gamma on any one of many inducible cell lines, such as Hela cells. These cell-based secondary screening 
assays can be performed on a large scale and can thus accomodate very large collections of candidate compounds. 
[0085] If the expression of MHC class II is measured at the mRNA level, the detected function is the transcription 

45 of MHC class II gene. 

[0086] In this case, candidate inhibitors may be tested for their capacity to inhibit said function by quantification of 
MHC II mRNA levels, especially by Rnase protection analysis. 

[0087] Different MHC II isotypes may be searched in said quantification procedure. 

[0088] In a second embodiment, the detected function is the formation of RFX complex Then the invention relates 
50 to a process for identifying inhibitors which have the capacity to inhibit the formation of the RFX complex or the assem- 
bly of its three subunrts (RFX5, RFXAR RFXANK). 

[0089] The three subunrts may be mixed with the potential inhibitor in conditions which allow the formation of the 
RFX complex in the absence of any efficient inhibitors. 

[0090] Then, the presence or absence of the RFX complex can be detected. Detection of the presence or absence 
55 of the RFX complex may be done with antibodies specific for the complete RFX complex or for the RFX complex as con- 
stituted by the 3 subunrts assembled. Th function of the second embodiment may be searched or measured with indi- 
rect methods as the ones used to search or measure the function in the second embodiment. 

[0091 1 In a third embodiment, the detected function is the binding of the RFX complex to its DNA target (especially 
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the X box motif of MHC il promoter). Then the invention relates to a process for identifying inhibitors which hav the 
capacity to inhibit the binding of th RFX complex to its DNA target. 

[0092] Said process may be a straightforward assay to measure the binding of the RFX complex (composed of 
RFX-ANK, RFX5 and RFXAP) to its DNA target. It may be done for example by gel retardation assays. 
5 [0093] Said process may be performed on a large scale and will detect compounds capable of inhibiting the binding 
of RFX to DNA. Such an assay can be set up on a very large scale, for the screening of a large quantity of candidate 
molecules. 

[0094] An embodiment of this process may comprise the following steps : 

10 - the DNA fragment corresponding to the X box of the MHC II promoters is mixed with a nuclear extract of a cell and 
with the substance to be tested ; 
the mixture is put on a gel for running ; 

if the substance does not inhibit the formation of the RFX complex, then the RFX complex binds to the DNA and 
the DNA-protein association migrates slower than the non-DNA bound RFX complex. 

15 

[0095] In a fourth embodiment, the detected function is interaction between the RFX complex, and at least one of 
the transcription factor X2BP, NF-Y and CIITA. 

[0096] The invention relates to a process for identifying inhibition which have the capacity to inhibit interaction 
between the RFX complex and at least one of the transcription factors X2BP, NF-Y and CIITA. 
20 [0097] Said process may comprise the mixing of the candidate inhibitor with the RFX complex and at least one of 
the transcription factors X2BP. NF-Y and CIITA. 

[0098] Then, the presence or absence of the interaction could be measured and therefore the inhibitors identified. 
[0099] In a fifth embodiment, the detected function is the correction of the MHC II expression defect of cell lines 
from complementation group B. 
25 [0100] Thus, the invention relates to a process for identifying inhibitors which have the capacity to inhibit the cor- 
rection by a transcription factor of the invention of MHC class II expression defect of cell lines from complementation 
group B. 

[01 01 ] Such a process may comprise the following steps : 

30 - cotransfection of cells of patients from complementation group B with the potential inhibitor and a transcription fac- 
tor of the invention, especially RFXANK cDNA. 
analysis of expression of HLA class II molecules. 

[0102] Analysis of expression of HLA class II molecules may be done at the surface of transfectant cells (detection 
35 of peptide or protein expression) or inside the transfectant cells (detection of mRNA expression). 

[0103] In a second part of this important aspect of the invention, the invention concerns inhibitors which have the 
capacity to inhibit the synthesis of a transcription factor of the invention. 

[0104] The synthesis of the molecule of interest of the invention (transcription factor of the invention) may be the 
translation of the nucleic acid molecule of the invention into the protein or peptide of the invention encoded by said 
40 nucleic acid molecule. 

[0105] Said inhibitors may inhibit any product which contributes to the synthesis of a transcription factor of the 
invention. 

[0106] A product which contributes to the synthesis of a transcription factor of the invention may be a nucleic acid 
molecule of the invention either a DNA or a RNA molecule or a regulatory sequence of said nucleic acid molecule. 
45 [0107] Said product may particularly be a DNA molecule coding for RFXANK as it is shown in figure 2, any DNA 
sequence with at least 80 % identity, preferably 90 % identity with said DNA molecule or any part of said DNA molecule 
or said DNA sequence. 

[01 08] Said product may be a RNA molecule corresponding to the DNA molecule coding for RFXANK as it is shown 
in figure 2, any RNA molecule with a nucleotidic sequence having at least 80 % identity, preferably 90 % identity with 
so said RNA molecule or any part of said RNA molecules. 

[0109] Said inhfoitors may be a ribozyme, a DNA or a RNA antisense. 

[011 0] Said inhtoitors may be a ribozyme, a DNA or a RNA antisense of the invention as recited above. 
[0111] Complementary nucleotide sequence of the nucleic acid molecule of the invention, also referred to 
as < ( antisense ) ) RNA or DNA are known to be capable of inhibiting the synthesis of the protein encoded by the reve- 
55 lant nucleic acid molecule of the invention. 

[0112] The person skilled in the art and provided with the nucleic acid molecule sequences of the invention men- 
tioned above will be in a position to produce and utilize the corresponding <( antisense )> RNA and DNA and to use 
them for the inhibition of synthesis of the transcription factor of the invention. 
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[01 1 3] Inhibition of the synthesis of a transcription factor of the invention will result in the inhibition of th expression 
of MHC class li genes. Indeed, it is kwown from th study of mutant cells that a deficiency in protein or peptide of th 
invention is accompanied by a lack of expression of MHC class II genes. 

[0114] Theuseof Uantisense >) RNA and DNA molecules, as described above, as inhibitors of MHC class II gene 
expression will be important in medical conditions where a reduction of MHC class II molecules is desirable, as in the 
case of autoimmune diseases. 

[01 1 5] The invention also relates to fragments of said nucleotide sequences, preferably to fragments of their coding 
regions, including fragments of complementary or ( ( antisense >) RNA and DNA. The person skilled in the art and pro- 
vided with the sequences described above is in a position to produce the corresponding short < ( antisense > ) aligonu- 
cleotides and to use them to achieve inhibition of a transcription factor of the invention synthesis and therefore inhibition 
of MHC class II gene expression. 

[01 1 6] Said inhibitors may be produced by reference to the nucleic acid molecule of the invention and may be iden- 
tified or are identifiable by a screening procedure for selecting candidate inhibitors capable of inhfoiting the expression 
of MHC II molecules. 

[0117] The screening procedure may be a test in simple robust cell based assays of the expression of MHC class 
II molecules at the surface of cells. Such assays, based on detection with available. monoclonal antibodies, are readily 
available. They will involve both a search for inhibition of constitutive expression of MHC class II on B lymphocyte cell 
lines and/or a search for inhibition of induction of MHC class II expression by interferon gamma on any one of many 
inducible cell line, such as Heia cells. These cell-based secondary screening assays can be performed on a large scale 
and can thus accomodate very large collections of candidate compounds. 

[01 18] The screening procedure for selecting substances capable of inhibiting the expression of MHC II molecules 
may be based on the detection of the capacity to inhibit the binding of the RFX complex to its DNA target 
[0119] Said process may be a straightforward assay to measure the binding of the RFX complex (composed of 
RFX-ANK, RFX5 and RFXAP) to its DNA target, by gel retardation assays for example. 

[0120] Said assay may be performed on a large scale and will detect compounds capable of inhibiting the binding 
of RFX to DNA. Such an assay can be set up on a very large scale, for the primary screening of a large quantity of can- 
didate molecules. 

[0121] This assay can be summarized in the following steps : 

the DNA fragment corresponding to the X box of the MHC II promoters is mixed with a nuclear extract of a cell and 
with the substance to be tested ; 
- the mixture is put on a gel for running ; 

if the substance does not inhibit the formation of the RFX complex, then the RFX complex binds to the DNA and 
the DNA-protein association migrates slower than the non-DNA bound RFX complex. 

[0122] The five embodiments of the process for identifying inhibitors which have the capacity to inhibit a function or 
an activity of a transcription factor of the invention may be used to identify inhibitors which have the capacity to inhibit 
the synthesis of a transcription factor of the invention. 

[0123] Another process for identifying inhibitors of the invention is the designing of inhibitors on the basis of the 
three dimensional structure of the protein or peptide of the invention, an information that can be obtained from recom- 
binant protein or peptide of the invention using state of the art technology for example X-Ray structure analysis, spec- 
troscopic methods, etc... 

[0124] The process for identifying inhfoitors of the invention may in addition to the functional assays describes 

above, also include a preliminary or primary screening for testing a large number of candidates. 

[0125] Said primary screening may be another well established procedure used on a large scale which consists in 

screening for the binding of molecules to a peptide or a protein of the invention (or RFX5 or RFXAP). 

[0126] The peptide or protein of the invention will preferably be produced by recombinant techniques in order to 

obtain recombinant peptide or protein of the invention. 

[0127] Such binding assays are performed on very large scales and on a routine basis by companies such as 
Scriptgen (Waltham, Mass) or Novalon (Durham, NC). Assays to detect such binding involve either ligand-induced 
change in protein conformation (Scriptgen) or ligand-induced displacement of molecules first identified as binding to the 
protein or peptide of the invention (Novalon). 

[0128] In a preferred process for identifying inhibitors, a protein or a peptide of the invention may be used for the 
identification of low molecular weight inhibitor molecules as drug candidates. 

[0129] Inhibitors of a protein or a peptide of the invention and of the expression of MHC class II genes, as potential 
drug candidates are preferably identified by a two step process : 

in the first step, compatible with large scale, high throughput, screening of collections ((< libraries ))) of small 
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molecular weight molecules, a protein or a peptide of the invention is used in a screening assay for molecules capa- 
ble of simply binding to th protein or the peptide of th invention (= < < ligands > >). Such high throughput screening; 
assays are routinely performed by companies such as Novaion Inc. or Scriptgen Inc., and are based either on com- 
petition for binding of peptides to the target protein or on changes in protein conformation induced by binding of a 
ligand to the target protein. Such primary high throughput screening for high affinity ligands capable of binding to a 
target recombinant protein are available commercially (under contract) from such companies as Novaion or Script- 
gen. 

In the second step, any low molecular weight molecule identified as described above as capable of binding to a pro- 
tein or a peptide of the invention, is tested in the functional RFX complex assay or in the functional MHC II expres- 
1 sion at the surface of cells assay. 

[0130] All candidate molecules are thus tested, at different concentrations, for a quantitative assessment of their 
anti-protein or anti-peptide of the invention inhibitory efficacy. 

[0131] Substances exhibiting anti-protein or anti-peptide or anti-nucleic acid molecule of the invention inhibitory 
effects are then tested for obvious toxicity and pharmacokinetics assays, in order to determine if they represent valuable 
drug candidates. 

[0132] Once a substance or a composition of substances has been identified which is capable of inhfciting a pro- 
tein, a peptide or a nucleic acid molecule of the invention, its mode of action may be identified particularly its capacity 
to block transcription or translation of a protein or a peptide of the invention. 
[0133] This capacity can be tested by carrying out a process comprising the following steps : 

0 contacting the substance under test with cells expressing the protein or the peptide of the invention, as previously 
defined, and 

ii) detecting loss of a protein or a peptide of the invention expression using a protein or a peptide of the invention 
or anti-protein or anti-peptide of the invention markers such as specific, labelled anti-protein or anti-peptide of the 
invention. 

[0134] The antibodies used in such a detection process are of the type described earlier. 

[01 35] The invention also relates to a kit for screening substances capable of blocking a protein or a peptide of the 
invention activity, or of blocking transcription or translation of the protein or the peptide of the invention. 
[0136] Inhibitors of the invention and especially inhibitors identifiable by the process for identifying inhibitors recited 
above, and compatible with cell viability, may be identified chemically and on the basis of their structure. New collections 
of related molecules may be generated (< < analogue library ) >) and tested again with any one of the process for iden- 
tifying inhibitors as described above. Candidate molecules capable of inhibiting any function of a transcription factor of 
the invention at the smallest molar concentration may be selected and considered as candidate immunosuppressive 
agents. They may first be tested for their effect on cells of various animal species in culture and then for in vivo studies 
in appropriate animal models. Such studies may test an effect on MHC class II expression, on activation of T lym- 
phocytes in various well established experimental protocols, as well as various experimental models of organ transplan- 
tation. 

[0137] An example of inhibitors of the invention may be natural mutants of RFX-ANK as present in MHC II defi- 
ciency patients of complementation group B. 

[0138] Said mutants may be mutants with the DNA or amino acid sequence as shown in figure 5 or recited in the 
legend of figure 5 or derivates thereof. Such mutants may be splice variants. 

[0139] Said mutants may be used in a DNA or RNA form. Furthermore, said DNA or RNA form could be present in 
a vector and especially under the control of a strong promoter, like the CMV promoter, in order to overexpress the 
mutants and to overcome their recessive nature 

[0140] The inhibitors of the invention are used in prophylactic and therapeutic treatment of diseases associated 
with aberrant or abnormal expression of MHC class II genes. 

[0141] The inhibitors of the invention are useful as immunosuppressive agents, T-cell inactivation agent, anti- 
inflammatory agent, immunomodulators, reducing or downregulating agent of the level of a MHC II expression in a 
reversible or irreversible manner. 

[0142] The inhibitors of the invention are useful as a pretreatment of the recipient before transplantation especially 
in bone-marrow transplantation to avoid or reduce the risk of rejection of the transplanted organs. The inhibitors of the 
invention may be administrated after transplantation as long as necessary to avoid the risk of rejection. 
[0143] The invention concerns methods for treating or preventing autoimmune disease, by administering effective 
amounts of substances capable of downregulating the expression of HLA class II genes. Among others, these includ 
autoimmune diseases where an aberrant and excessive expression of HLA class II molecule at the surface of certain 
ceils is thought to be responsible for the autoimmune pathological processes. These include Insulin Dependant Diabe- 
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tes (IDD), Multiple Sderosis (MS), Lupus Erythematous (LE) and Rheumatoid Arthritis (RA). 

[0144] The invention also relates to pharmaceutical compositions comprising an inhibitor of the invention in asso- 
ciation with physiological acceptable carriers, and to methods for the preparation of medicaments for use* in th rapy or 
prevention of diseases associated with aberrant expression of MHC class II genes using these substances. 
5 [0145] The proteic or peptidic inhibitors of the invention can be prepared in a DNA or in a RNA form alone or with 
one of the known DNA or RNA formulation (liposomes, antibodies, viral vectors, ...). 

[0146] A further object of the present invention is to use a protein or a peptide or a nucleic acid molecule of the 
invention to generate MHC class II negative transgenic animals or transgenic animals with reduced level of expression 
of MHC class II. 

10 [0147] A preferred animal which can be generated as an MHC class II negative transgenic animal or as a trans- 
genic animal with reduced level of expression of MHC class II is pig. 

[0148] The generation of transgenic animals of the invention may involve the introduction of anyone of the inhibitors 
of the invention as a transgene. 

[0149] An alternative approach consists in cfisrupting or replacing the RFX-ANK gene of the transgenic animal by 
is any available technique so that this essential transcription form of the MHC class II promoter is not transcribed. 

[0150] The RFX-ANK gene of the transgenic animal of interest can be identified by simple comparison with the 
human RFX-ANK gene. The mouse RFX-ANK gene shown in figure 3 has been identified by such comparison (see 
example 2). 

[0151] Such an approach can be done by homologous recombination. The < < knockout >) technique may particu- 
20 larly be used. 

[0152] The animal of the invention may then be used as a source of organs for xenogenic transplantation or as a 
source of cells for universal cell transplants. 

[0153] The findings reported here have two other medically relevant implications. All patients with MHC-II defi- 
ciency can now be readily classified into their respective genetic complementation groups by a direct assay involving 

25 correction with each of the four regulatory genes. An assay based on the delivery of the four MHC-II regulatory genes 
to PBLs via an efficient retroviral vector system may be used. This assay should allow a rapid and specific genetic diag- 
nosis that avoids the need to establish stable cell lines and to perform tedious complementation assays by cell fusion. 
The second implication is that the large majority of MHC-II deficiency patients can now be considered as candidates for 
gene therapy, as a possible alternative to bone marrow transplantation that has poor success rate in this disease 38 . 

30 Since the three protein subunrts of the RFX complex are expressed constitutively in ail cell types, these three genes rep- 
resent a favorable situation for genetic correction. Gene therapy is particularly relevant in the case of the RFXANK, 
which is affected in the large majority of patients with MHC-II deficiency. Transfer of the revelant wild-type genes into 
either muftipotent processor cells or PBLS of MHC II expression deficiency patients may be operated. 
[0154] Another object of the invention is a one-step high purification process by DNA affinity of substances conv 

35 prising the following steps : 

identifying a DNA with more than one binding site ; 
adding to this DNA an extract containing the substance to be purified ; 
- washing extensively (by increasing salts, or period of times, or by adding competitor DNA for example) ; 
40 - isolating the complex by its DNA target- 
releasing the substance from its complex. 

[0155] In an example, this process has allowed to obtain in a single step a yield greater than 80 % and an enrich- 
ment with respect to the acid nuclear extract of at least 3000 fold for the substance to be purified. 
45 [01 56] This new purification process is considerably better than known affinity purification procedure based on the 
binding of the desired substance to its DNA target This purification procedure is better in terms of yield, enrichment, 
purity and simplicity. 

[0157] The purification process of the invention applies to all molecules that are involved in stabilizing a DNA bind- 
ing murtimolecule complex and especially that are involved in stabilizing protein-protein interaction. 

so [0158] Thus, an efficient single step DNA affinity purification procedure that exploits the remarkable stability of a 
large multi-protein complex formed by RFX, X2BR and NF-Y, which bind cooperatively to a longer segment of MHC-II 
promoters (the X-X2-Y box region) 18 ' 19 ' 23 (see Fig. 6) has been established. The very high stability of this large com- 
plex represents an obvious advantage in terms of the yield and enrichment obtained in the purification procedure. 
Another major advantage is that it allows the selection of those factors that are part of a physiologically relevant murti- 

55 protein — DNA complex, at the expense of numerous other proteins capable of binding individually to the isolated X, X2, 
and Y motifs. Thus, in addition to RFX purification, it allows the co-purification of the biologically relevant X2 and Y box- 
binding factors. 

[0159] A further object of the invention is a process of isolation of an unknown molecule which is involved in stabi- 
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liztng a DNA binding mutti molecule complex and especially that are involved in stabilizing prot in-protein interaction; 
[0160] Thus, her we report th isolation of a novel subunit of the RFX complex by an efficient single^step DNA- - 
affinity purification approach. This procedure takes advantage of a strong co-operative protein-protein interaction that 
results in the highly stable binding of three distinct multi-protein transcription factors — RFX, X2BP (ref. 16) and NF-Y 
s (ref.17) —to the X-X2-Y region of MHC-II promoters 10 ' 18 ' 19 . The higher order RFX — X2BP — NF-Y complex formed on 
DNA contains the physiologically relevant proteins involved in the activation of MHC-II promoters. With the use of this 
new affinity purification procedure, the novel component of the RFX complex called RFXANK was identified and the cor- 
responding gene was isolated. 

[01 61 ] By analogy all that has been described for RFXANK is described for RFXAP provided that the amino acid or 
10 nucleotide sequence of RFXAP (reference 1 0) is substituted for the amino acid or nucleotide sequence of RFXANK and 
provided that complementation group D is substituted to complementation group B. 

[0162] Thus, for example, the protein or peptide of the invention in the case of RFXAP is capable of restoring the 
MHC II expression in cells from MHC II deficiency patients in complementation group D and comprises all or part of the 
amino acid sequence shown for RFXAP in reference number 10. 
is [0163] This reasoning by analogy applies also to RFX5 (complementation group C) whose amino acid sequence 
and nucleotide sequence, can be found in reference 9. 

[01 64] Generally, said reasoning by analogy applies to each subunit of the RFX complex. 
Figure legends 

20 

[0165] 

Figure 1 : Purification of MHC-II promoter-binding proteins and identification of a novel 33 kDa subunit of the RFX 
complex. Affinity purified DRA promoter binding proteins (lane 1) were immunoprecipitated with pre-immune serum 
25 (lane 2) or anti-RFXAP antibody (lane 3). Proteins were fractionated by electrophoresis in 10% SDS-polyacryia- 
mkJe gels and visualised by silver-staining. The identity of the three RFX subunrts (boxed: RFX5, RFXAP, and the 
new p33 subunit) and of other proteins (NF-Y, PARP) was determined as descrfced in the text. Bands derived from 
the antiserum (as) are indicated. 

30 Figure 2 : Sequence analyses of RFXANK. 

Nucleotide and amino acid sequences of RFXANK. The first nucleotides of the 10 exons that are spliced 
together into the RFXANK cDNA are indicated by arrows. The exon limits were established by comparing the cDNA 
sequence with the genomic sequence containing the entire RFXANK gene. The 'cag* trinucleotide resulting from 
alternative splicing of exon 4 is in lowercase and bokftype. The in-frame TGA stop codon preceding the RFXANK 

35 open reading frame is underlined. Tryptic peptides identified by microsequencing are underlined. 

Figure 3 : Sequences comparisons 

Amino acid sequence alignment between RFXANK and homologous proteins containing ankyrin repeats. The 
human RFXANK sequence is shown at the top (Hs RFXANK). Identical amino acids in mouse RFXANK (Mm RFX- 
40 ANK) and the other proteins are shown as dashes. Gaps are represented by points. 'Hs homol' and 'Mm homol' cor- 
respond to the predicted translation products of cDNAs encoding a highly homologous protein present in humans 
and mice, respectively. The ankyrin repeat-containing region of mouse GABPb (ref. 28) is shown at the bottom. The 
secondary structure prediction of the ankyrin repeats (ank 1-3) was inferred from the known structure of GABPb 
(ref. 35). H. helix; T, turn 

45 

Figure 4 : RFXANK restores MHO II expression in cells from patients from complementation group B. 

a, Cell lines from complementation groups A (RJ2.2.5), B (BLS1), C (SJO), and D (DA) were transfected with 
the empty pEBO-76PL expression vector (left) or with pEBO-RFXANK (right). Transfected cells were stained 

so for HLA-DR expression and analyzed by FACScan. 

b, pEBO-RFXANK restores expression of ail three MHC-II isotypes in BLS1 cells. BLS1 cells transfected with 
pEBO-76PL (upper panels) or with pEBO- RFXANK (middle panels) were analyzed by FACScan for expression 
of HU\-DR, -DP, and -DQ. Control Raji ceils were analyzed in parallel (bottom panels). 

55 Figure 5 : Mutations within the RFXANK gene in three patients in MHOH deficiency group B. 

a, cDNA clones isolated from ceils of patients BLS1 . Ab, and Na lack exons 5 and 6. A schematic map of the 
RFXANK gene is shown at the top: non-coding exons, open bars; coding exons, filled bars. The RFXANK 
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mRNAs isolate by RT-PCR from normal cells (wt) and from patient cells (BLS1. Ab. Na) are represented 
b low the RFXANK gene; untranslated regions (lines), protein-coding regions (boxes), and the position of 
exons 4 to 7 are indicated. As a consequ nee of the aberrant splicing in patient cells, exons 4 and-7 of the_RFX- 
ANK mRNA are joined together. This leads to a translational frame shift (amino acids in lower case) followed 
by a premature termination at a TGA stop codon at nucleotide 955. 

b, Short deletions affecting exon 6 of the RFXANK gene are responsible for the aberrant splicing pattern. The 
entire region spanning exons 4 to 7 of the RFXANK gene from patients BLS1 , Na and Ab was amplified by PCR 
and sequenced. The sequences of splice junctions flanking exon 6 are shown: intron sequences are in lower- 
case and exon sequences are in uppercase. The 26 bp deletion found in patients Ab and Na (top), and the 58 
bp deletion found in patient BLS1 (bottom) are boxed. The genomic region spanning exon 6 was amplified by 
PCR from the patients and their parents and siblings. The PCR products obtained are shown below the corre- 
sponding pedigrees. Abbreviated names of individuals are indicated; squares, males; circles, females; filled 
symbols - patients that developed MHO II deficiency. Patients are homozygous for the mutated alleles (-/-), par- 
ents are heterozygous (+/-), and healthy siblings are either heterozygous (+/-), or homozygous for the wild type 
allele (+/+). For patient Na, the family was not available and the B cell line Raji (Ra) was used as homozygous 
wild type control. 

Figure 6 : A functional RFX complex can be reconstituted with in vitro translated subunits. 

a, All three subunits are required for binding of the RFX complex to DNA. RFX5, RFXAP, and RFXANK were 
translated in vitro alone or in all possfcJe combinations in the presence of [35S]-Met. Translation products were 
analysed by SDS-PAGE and by EMSA for their ability to bind to a X box [32P]-labeled probe. For the EMSA 
experiments, a crude B cell nuclear extract (NE) was used as a positive control. 

b, The RFX complex reconstituted in vitro contains all three subunits. RFX5 and RFXAP were co-translated in 
vitro together with RFXANK (upper panel) or a tagged version (FLAG- RFXANK) containing the FLAG epitope 
at its N-terminus (lower panel). The translation reactions were used for EMSA. Where indicated, binding reac- 
tions were supplemented with pre-immune serum or antibodies specific for RFX5, RFXAP or the FLAG 
epitope. 

Figure 7 : Elucidation of the molecular defects in MHC-II deficiency has led to the identification of four essential 
and specific transactivators of MHC-II genes. A prototypical MHC-II promoter is depicted with conserved X, X2 and 
Y motifs. Each of the four transactivators affected in MHC-II deficiency is highlighted. The respective complemen- 
tation groups, the number of patients reported, and the chromosomal location of each of the corresponding gene 
are indicated. CIITA is a highly regulated non-DNA binding co-activator that is responsible for cell-type specificity 
and inducibility of MHC-II expression. RFX5, RFXAP and RFXANK are the three subunits of the X box-binding RFX 
complex. All four genes affected in the disease are essential and specific for MHC-II gene expression. The other 
promoter-bincfing complexes. X2BP and NF-Y, are not specific for MHC-II expression and are not affected in MHC- 
II deficiency. 

Examples 

Example 1 : purification of the RFX complex and identification of a novel 33 kDa subunit. 

[0166] Three distinct MHC-II deficiency complementation groups are characterized by a lack of binding of the RFX 
complex 2 ' 13 . Since two of these groups (C and D) result from defects in the genes encoding the RFX5 and RFXAP sub- 
units of RFX (refs 9,10), it seemed likely that the third group (B) could be accounted for by mutations in a third subunit. 
The existence of a third subunit is supported by the finding that RFX5 and RFXAP are not sufficient to generate an X 
box specific DNA binding complex 9,10 . 

[0167] We had observed earlier that the RFX complex binds to the MHC-II promoters in a cooperative 
manner 18 ' 19 22 together with two other transcription factors, NF-Y and X2BP (rets 16 ' 17 ). These three factors individually 
bind DNA with a low affinity. For example, the haff life of an RFX— DNA complex is only 2 to 4 minutes 18,19 . However. 
RFX, X2BR and NF-Y combine to form an extremely stable higher order muitiprotein complex on MHC-II 
promoters 18 ' 19,22 , extending the haff life of the complex to more than 4 hours (ref.23). This particular feature was 
exploited to develop an efficient single-step DNA-affinity purification procedure. 

[0168] All steps were done at 4°C. A large scale nuclear extract (1 .9 g of protein) was prepared as described 44 from 
2 X 10 11 TK6 B cells. 130 mg of extract (5 mg/ml) were supplemented with MgCI 2 to 6 mM, KCJ to 100 mM, NP-40 to 
0.01%, poiy(dldC) o poly(dldC) to 0.125 mg/ml, and single-stranded E. coli DNA to 0.125 mg/mi. The extract was 
cleared by centrifugation and incubated with 1 mg of streptavidin magnetic beads (Dynal) coated with 30 mg of a 
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WXX2Y DRA promoter fragment (nucleotides -150 to -47) biotinyiated at the 5' end of the upper strand. Binding was 
allowed to proceed for 6 hours with end-over-end rotation. The beads wer then washed 6 times with 40 ml buffer D (2Q 
mM HEPES pH 7.9, 100 mM KCI f 6 mM MgCI 2 , 1 mM DTT T 20% glycerol, 0.01% NP-40). Non-specific competitor DNA 
(salmon sperm DNA and single-stranded E. cdi DNA, both at 0.125 mg/ml) was included during the last four washes. 
In addition, the last two washes contained 0.5 mg/ml specific competitor DNA in form of Hinf Indigested plasmid pD R300 
(ret 45 ). Following the washes with DNA competitors, the beads were washed a further three times with buffer D con- 
taining increased salt concentration (0.2 M KCI). The proteins that remained bound to the biotinyiated DRA promoter 
fragment were then eluted with buffer D containing 0.6 M KCJ and 10 mM EDTA instead of MgCI 2 . 
[0169] To summarize, a biotinyiated DRA promoter fragment containing the binding sites for RFX, X2BP, and NF-Y 
was coupled to streptavidin-coated paramagnetic beads and incubated with a crude B cell nuclear extract to allow for- 
mation of the muHiprotein complex. The beads were then washed extensively with a buffer containing non-specific com- 
petitor DNA. Finally, the degree of purification was further increased by washing the beads with a buffer containing 
specific competitor DNA, namely the same DRA promoter sequence that was used for selection. The rationale for 
including the latter step in the purification procedure was that it should permit elimination of proteins that bind the DRA 
promoter less stably than the RFX— X2BP — NF-Y higher order multiprotein complex. For example, other members of 
the X box binding protein family 24 , such as RFX1-RFX4, would be eliminated preferentially by the final washing step 
because they bind to the DRA promoter with significantly less stability, and thus with a considerably shorter half-life, 
than the multiprotein RFX — X2BP — NF-Y complex. 

[0170] This single-step affinity purification procedure turned out to be very efficient. As judged by western blotting 
experiments with anti-RFX5 antfoodies and Bradford assays to quantitate total protein concentrations, the yield of RFX 
was greater than 80% and the enrichment with respect to the crude nuclear extract was at least 3000 fold. Analysis of 
the purified fraction by SDS-PAGE indicates that it contains only 1 1 different protein bands (lane 1 , Fig. 1). Immunopre- 
cipitation of the RFX complex from the affinity purified fraction was a critical step in the identification of the different pro- 
tein components of this complex. Rabbit polyclonal antisera for RFX5 and RFXAP have been described previously 10 . 
Immunoprecipitation was done according to standard protocols 46 . Analysis of the co-immunoprecipitated proteins was 
performed both by one-dimensional (lane 3, Rg. 1) and two-dimensional gel electrophoresis. Two bands were recog- 
nized by Western Wot as RFX5 and RFXAP. Two additional protein bands were found at 120 kDa and 33 kDa respec- 
tively. The two proteins were isolated from a preparative gel and subjected to microsequendng by capillary liquid 
chromatography tandem mass spectroscopy (LC-MS/MS, ref.25). 

[0171] Purified proteins were dialysed against buffer D, precipitated with acetone, and separated by SDS-PAGE. 
After staining the gel with Coomassie Brilliant Blue, bands were excised and subjected to microsequencing as 
described bdow. 

[0172] The 120 kDa band was identified as poly(ADP-ribose) polymerase (PARP), an abundant chromatin-associ- 
ated enzyme 26 . The significance of co-immunoprecipitation of PARP with the RFX complex is not clear. On the other 
hand, the novel 33 kDa protein (Fig. 1), was considered a good candidate for an additional subunit of the RFX complex. 
This protein was reproducibly and quantitatively co~immunoprecipitated from the total affinity purified fraction with antis- 
era directed against either RFXAP or RFX5. Its physical association with RFX5 and RFXAP is very tight because it is 
partially resistant to treatment with 1M Guanidine-HCI. The 33 kDa polypeptide thus behaves as a bona fide subunit of 
the RFX complex. From its mobility relative to RFXAP (36 kDa), the 33 kDa protein does not correspond to the band 
referred to as p41 by others and believed to be a component of RFX (ret 27 ). Indeed, no additional polypeptide of this 
size range, besides RFXAP and the novel 33kDa protein, can be co-immunoprecipitated as part of the RFX complex. 
[0173] RFXAP antiserum was immunoaffinrty-purified on a column containing a C-terminal peptide of RFXAP 
(ret 10 ) coupled to CNBr-activated Sepharose (Pharmacia-Biotech). Affinity purification of RFXAP antibodies was done 
according to standard protocols 46 . 

[01 74] In the DNA-aff inrty purified preparation (see lane 1 , Fig. 1 ), three proteins were identified by Western blotting 
and immunoprecipitation as subunits of the NF-Y complex. Direct microsequencing by LC-MS/MS confirmed the pres- 
ence of NF-Y peptide sequences. The remaining protein bands, distinct from both RFX and NF-Y, probably represent 
X2BP and/or contaminants. 

Example 2 : Isolation of the RFXANK gene. 

[0175] Approximately 500 ng of the protein present in the 33 kDa band was purified by SDS-PAGE and subjected 
to sequence analysis by LC-MS/MS. 

[0176] RFXANK was sequenced by capillary liquid chromatography tandem mass spectrometry (LC-MS/MS). The 
protein excised from the gel was digested with trypsin (Promega, Madison, Wl) and extracted as described 
previously 25 . Prior to LC-MS/MS, the peptides were vacuum dried and resuspended in H20. Peptides were loaded onto 
a 75 mm RP-HPLC column, packed with 200 A pore, 5 mm particles of Magic C18 packing material (Michrom Biore- 
sources, Auburn, CA) at 1000 psi using a pressure bomb 47 . Subsequent elution was performed at 250 nL/min after frac- 
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tionation through a splitting Tee (Valco Instruments, Houston, TX) of a linear gradient that was developed for 30 min at 
50 mUrnin from buffer A (2 % CH3CN, 98 % H2Q, 0.4 % CH3COOH, 0.005 % C4HF702) to buffer B (80 % CH3CM; 20 
% H20, 0.4 % CH3COOH, 0.005 % C4HF702) on a Michrom Uftrafast Micrcprotein analyzer (Michrom^Bioresources, 
Auburn, CA). Tandem mass spectrometry was conducted on a Finnigan MAT TSQ 7000 (San Jose, CA) equipped with 
an in house built mterospray device for peptide ionization. The instrument was run in automated mode, where parent 
masses were automatically selected for fragmentation 48 . Collision induced dissociation (CID) spectra were correlated 
with database entries using the SEQUEST programme 49 and verified by manual interpretation. Databases used for CID 
correlation were the cfoOWL rrttp:/Awww.bicchem.ucL^ and the 

dbEST(rrttp://www.ncbt.nlm.n^ 

[0177] Perfect matches to three independent peptides (Fig. 2a) were identified in a variety of ESTs as well as in the 
theoretical protein product deduced from a gene identified by genomic DNA sequencing (GenBank accession number 
2627294). The complete sequence of the corresponding mRNA (Fig. 2a) was determined by assembling the ESTs into 
a single contig and by comparing it to the genomic sequence. 

[01 78] Perfect matches to three RFXANK peptides were found in the putative protein product (GenBank accession 
number 2627294) encoded by a human gene identified by sequencing of two overlapping cosmids (GenBank accession 
numbers AD000812 and AC0031 10) derived from the short arm of chromosome 19, and in a large number of human 
EST clones (over 80 different ESTs; accession numbers of representative ESTs: AA282432, AA290933, AA41 1028, 
H63462, AA496321). The complete human RFXANK mRNA sequence was obtained by organizing the ESTs into a sin- 
gle contig. 

[01 79] The sequence was further confirmed experimentally by amplifying the entire open reading frame of the cor- 
responding cDNA by RT-PCR and by sequencing several independent clones. 

[0180] The resulting sequence was confirmed by comparison with the genomic sequence and by RT-PCR amplifi- 
cation and sequencing of RFXANK cDNA clones from control B cell lines (Raji and QBL). The following primers were 
used to amplify RFXANK cDNAs by PGR: 5'p33 (5'- (XX3TACGCGTCTAGACCATGGAGCTTACCCAGCCTGCAGA-3'), 
which overlaps the translation initiation codon, and 3'p33 (5'-TTCGAATTCTCGAGTGTCTGAGTCCCCGGCA-3') , 
which is complementary to the 3' untranslated region of RFXANK mRNA. Homology to RFXANK mRNA is underlined. 
The primers contain restriction sites at their 5' ends to facilitate cloning. RFXANK cDNAs were cloned into the expres- 
sion plasmtd EBO-76PL (ref. 8 ) and pBluescript KS (Stratagene). 12 RFXANK cDNA clones were sequenced on both 
strands. The nucleotide and amino acid sequences of human RFXANK were tested for homology to sequences in 
EMBL, GenBank, SwissProt, and dbEST Sequence analysis was performed with PC/gene (Intelligenetics), BLAST 
programs available through the NCBI server (httpy/www.ncbi. nlm.nih.gov), and a variety of proteomics tools 
(rmp^/www.expasy.ch/www/lools.rrtml). For multiple protein sequence alignments, CLUSTALW 
(http7/www2.ebi.ac.uk/clustalw) was used. ESTs were assembled into contigs with the TIGR Assembler 
(lrttp://www.tigr.org). The search for homology to human RFXANK identified EST clones corresponding to mouse 
(AA435121, AA616119, AA259432, AA146531) and rat (AA851701) orthologs, and to a highly homologous gene 
present in both man (AA496038, AA442702, AA205305, N25678, N70046, AA418029, AA633452. H39858, R 8621 3, 
AA418089, N64316, R63682, N55216) and mouse (AA245178, Z31339, AA118335). The sequences of mouse Rfxank 
and of the human and mouse homotogues were determined by organizing the corresponding ESTs into contigs. The 
mouse Rfxank sequence was confined by amplifying the cDNA by RT-PCR from C57BL6 mouse spleen RNA using the 
following primers : 

m5'p33 (5*- CCGTACGCGTCTAGACCATGGAGCCCACTCAGGTTGC -3*), which overlaps the translation initiation 
codon, 

and m3'p33 (5'- TTCGAATTCTCGAGTGCCTGGGTTCCAGCAGG -3 1 ), which is complementary to the 3' untranslated 
region of Rfxank mRNA. Homology to mouse Rfxank mRNA is underlined. The primers included 5* extensions with 
restriction sites that were used to clone the mouse Rfxank cDNA directly into the EBO-76PL expression vector 8 . 14 
clones were sequenced on both strands. 

[0181 ] Two splice variants were identified at approximately equal frequencies. They differ only by the insertion of a 
single CAG triplet (Fig. 2a) and probably result from the alternative usage of two possible splice acceptor sites situated 
3 nucleotides apart upstream of exon 4. An additional minor splice variant lacking exon 5 (see Fig. 2a) was also identi- 
fied, both in an EST and in one of the cDNA clones (data not shown). 

[0182] The cDNA corresponding to the 33 kDa protein contains a 260 amino acid open reading frame. The trans- 
lation initiation codon is preceded by an in-frame TGA stop codon, indicating that the coding region is complete. The 
deduced molecular weight (28.1 kDa) and isoelectric point (4.45) correspond well to the biochemical parameters deter- 
mined for p33 in one- and two-dimensional gel electrophoresis (data not shown). The protein encoded by the open 
reading frame is novel. In particular, it exhibits no homology to either RFXAP or RFX5, the two other known subunits of 
the RFX complex, nor to other members of the RFX family of DNA binding proteins 24 . A search for homology to known 
proteins and motifs did identify the presence of three ankyrin repeats (Fig. 2b). Together with the fact that it is an essen- 
tial subunit of the RFX complex, this led us to call the protein RFXANK. Outside of the ankyrin repeat region, the only 
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other recognizable feature is an N-terminal acidic region resembling transcription activation domains. 
[0183] EST clones corresponding to mouse and rat Rfxank were also identified in the data base. Mouse EST s were 
organized into a contig to generat a partial mous sequenc , which was then confirmed and completed by isolating 
mouse Rfxank cDNA clones by RT-PCR. Homology to human RFXANK is high (85 % overall amino acid identity), par- 
ticularly within and surrounding The ankyrin repeat region (94% amino acid identity. Fig. 2b). Two different splice vari- 
ants were found among mouse Rfxank cDNA clones. The major one, which is shown as the deduced amino acid 
sequence aligned with the human sequence in Fig. 2b, is characterized by an additional stretch of 10 amino acids that 
precedes the first ankyrin repeat. A minor splice variant lacking these additional 10 amino acids was also represented 
among the mouse cDNA clones isolated (not shown). RFXANK may belong to a family of related proteins because we 
identified a number of additional EST clones corresponding to at least one human and one mouse gene exhibiting a 
high degree of homology to RFXANK gene (Fig 2b). These are by far the most closely related sequences currently 
present in the data base. In addition, the ankyrin repeats of RFXANK show distinct but more limited homology (25-40% 
identity) to ankyrin repeat regions of a variety of other proteins 20 21 ' including the b subunit of the transcription factor 
GABP (ref. 28 ' 29 , see Fig. 2b). 

.Example 3 : RFXANK restores MHC-II expression in cells from MHC-lj deficiency patients in complementation 
group B 

[0184] Cell lines and culture. The in vitro generated MHC-II negative B cell line RJ2.2.5 (ref. 39 ), the EBV trans- 
formed B cell lines from patients Ab, Na, BLS1, Da and SJO (refe. 14 ' 40 ' 43 ), and the control B cell lines Raji and QBL 
were grown in RPM 1 1 640 medium (GIBCO BRL) supplemented with 1 0% fetal calf serum, penicillin, streptomycin and 
glutamine. Cells were incubated at 37 oC in 5% C02. The TK6 B cell line used for large scale nuclear extract prepara- 
tion was grown at 37 oC in rollers in C02 independent medium (GIBCO BRL) supplemented with 10% horse serum 
(GIBCO BRL), penicillin, streptomycin and glutamine. 

[0185] It seemed likely that mutations in the newly identified gene RFXANK could account for the lack of RFX bind- 
ing in the last non elucidated group of MHC-II deficiency. An RFXANK expression vector (pEBO-RFXANK) was there- 
fore transfected into BLS1 , a cell line from a patient in complementation group B. As control. pEBO-RFXANK was also 
transfected into cell lines from complementation groups A (RJ2.2.5), C (SJO) and D (Da), which carry mutations in 
CIITA, RFX5 and RFXAP, respectively 8 " 1 0,1 5 . 

[0186] The RFXANK cDNA (the splice variant with the additional CAG) cloned in pEBO~76PL (pEBO-RFXANK), or 
the empty pEBO-76PL expression vector were transfected by electroporation into RJ2.2.5, SJO, BLS1 and DA cells. 
Transfected cells were selected with hygromycin as described 8 . Transfected RJ2.2.5, SJO, BLS1 and DA cells were 
maintained under hygromycin selection for at least 10 days prior to FACScan analysis. 

[0187] BLS1 transfectants were sorted for HLA-DR expression before FACScan analysis of HLA-DR, H LA-DP, 
HLA-DQ and MHC-I expression. Sorting was performed using an HLA-DR specific antibody and magnetic beads 
(Dynal) as described previously 8 . FACS analysis was performed as described 1 °. 

[0188] Expression of HLA-DR was restored in the BLS1 cell line, while no reactivation of DR expression was 
observed in the other three cell lines (Fig. 3a). Complementation of BLS1 with pEBO- RFXANK restored expression of 
all three MHC-II isotypes (HUVDR, -DP and -DQ) to levels similar to or greater than those observed in the control B cell 
line Raji (Fig. 3b), indicating that all MHC-II a and b chain genes were reactivated coordinated by RFXANK. Re-expres- 
sion of MHC-II by transfection with pEBO- RFXANK was also observed in Na and Ab, two other cell lines from patients 
in complementation group B (data not shown). In contrast to the drastic effect of RFXANK on MHC-II, no effect on ceil 
surface expression of MHC class I (MHC-I) genes was observed in the complemented cells (data not shown). 

Example 4 : RFXANK is mutated in patients from complementation group B 

[0189] The specific complementation obtained in ceil lines from patients in group B suggested that RFXANK is 
affected in these patients. The entire coding region of RFXANK was therefore amplified by RT-PCR from the BLS1 , Na 
and Ab cell lines, subcloned and sequenced. 

[0190] The entire coding region of RFXANK mRNA was amplified by RT-PCR from patient cells using the 5'p33 and 
3*p33 primers described above. PCR products were subcloned into pBluescript and sequenced on both strands. For 
each patient, 3 independent cDNA clones were sequenced. The genomic DNA spanning exons 4 to 7 was amplified by 
PCR from patient cells using an exon 4 specific primer (5'-CCAGCTCTAGACTCCACCACTCTCACCAAC-3') having a 
5' extension containing an Xbal site (underlined) and an exon 7 specific primer (5'-CCTTCGAATTCTCGCTCTTTT- 
GCCAGGATG-3') having a 5' extension containing an EcoRI site. PCR products were sublconed into pBluescript KS 
(Stratagene) and 6 independent subclones were sequenced for each patient- Analysis of the wild type and deleted alle- 
les in the patients and their families was done by PCR using intronic primers flanking exon 6; (S'-GGTTCTCTAGATT- 
GGCAGCACTGGGGATAG-3T and (5'-GCTACX3AATTCCAGC AGACACAGCCAAAAC-3 1 ) . These primers carry 5' 
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extensions containing, respectively, Xbal and EcoRI sites (underlined). The sizes of the wild type and deleted PCR 
products are, respectively, 265 bp. 239 bp (Ab and Na) and 207 bp(BLSl) 

[01 91 ] Analysis of several independent cDNA clones revealed th presence in all three patients of the same aber- 
rant form of RFXANK mRNA lacking exons 5 and 6 (Fig. 4a). Splicing of exon 4 to exon 7 leads to a frame shift followed 
s by an out of frame stop codon (Fig. 4a) and thus results in the synthesis of a severely truncated RFXANK protein lacking 
the entire ankyrin repeat region (see Fig. 2b). 

[0192] To define the mutations leading to the aberrantly spliced RFXANK mRNA we amplified by PCR the region 
spanning exons 4-7 from genomic DNA isolated from patients BLS1 , Na and Ab, and control individuals. The PCR prod- 
ucts were subcloned and 6 independent subclones were sequenced for each patient. All clones from patients Na and 
10 Ab contained a 26 bp deletion that removes the splice acceptor site and the first nucleotide of exon 6; all clones from 
patient BLS1 contained a 58 bp deletion that removes the last 23 nucleotides and the splice donor site of exon 6 (Fig 
4b). 

[0193] To determine whether the patients are homozygous for the mutated alleles, the region spanning exon 6 was 
amplified by PCR from genomic DNA. Only the allele containing the 26 bp deletion was detected in Na and Ab, and only 
is the allele containing the 58 bp deletion was detected in BLS1 (Fig 4b). In the case of Na, this is consistent with the fact 
that the parents are consanguineous 7 . For Ab and BLS1, homozygosity was established by analyzing DNA from the 
other family members: both parents of Ab carry the allele containing the 26 bp deletion and both parents of BLS1 carry 
the allele containing the 58 bp deletion (Fig 4b). 

20 Example 5 : RFXANK Is essential for binding of the RFX complex to DNA. 

[0194] Ceils having mutations in RFXANK (group B), RFX5 (group C) and RFXAP (group D) all lack detectable RFX 
binding activity, suggesting that each of the three subunits is essential for binding. The availability of cDNA clones 
encocfing all three subunHs of the RFX complex allowed us to test this directly. The three subunits were translated in 
26 vitro, either singly or together in all possible combinations, and the translation products were tested for their ability to 
bind to an X box probe in an electrophoretic mobility shift assay (EMSA). 

[0195] In vitro transcription-translation reactions and electrophoretic mobility shift assays (EMSA) using nuclear 
extracts and in vitro translated proteins were done as descrfced 9 ' 22 ' 50 . The production of polyclonal rabbit antisera spe- 
cific for RFX5 and RFXAP and their use in supershrft experiments have also been descrtoed 10 . The monoclonal anti- 

30 FLAG antibody (M2, Kodak) was used in supershrft experiments at a final concentration of 20 ng/ml. The RFXANK 
cDNA tagged with a FLAG epitope at its N terminus was constructed as follows: The entire RFXANK open reading 
frame was amplified from pEBO-RFXANK plasmid by PCR with primers 3'p33 (described above) 
and FLAG-5'p33 (5'-CCGTACGCGTCTAGAATGGATTACAAAGACGATGACGATAAG ATGGAGCTTACCCAGCCTGCA- 
GAAGAC -3*). The FLAG epitope (DYKDDDDK) coding sequence is underlined. The PCR product containing the FLAG 

35 sequence fused to the 5* end of RFXANK was cloned in pBluescript KS (Stratagene). 

[0196] The results demonstrate that X box-binding complexes are indeed generated, but exclusively when all three 
subunHs are present (Fig. 5a). The RFX — DNA complexes that are generated with recombinant subunits migrate as 
three discrete bands, of which the lower co-migrates with the complex formed by native RFX in nuclear extracts (Fig. 
5a). The reason for the aberrant mobility is not clear. Reasonable explanations include an abnormal conformation due 

40 to defective folding, abnormal posttranslational modifications, or an incorrect stoichiometry between the different subu- 
nits. Supershrft experiments with antibodies directed against each of the three subunits demonstrated that the different 
complexes all contain RFX5, RFXAP and RFXANK (Fig. 5b). The anti-RFXS and anti-RFXAP antfcodies used in these 
experiments are known to supershrft the RFX complex efficiently and specifically 10 . In the case of RFXANK, the com- 
plexes were generated with a FLAG epitope-tagged version of RFXANK, and an anti-FLAG antibody was used for the 

45 supershrft. The specif icrty of the supershift obtained with the anti-FU\G antfoody is demonstrated by the fact that migra- 
tion of the complexes is not affected when they contain RFXANK lacking the FLAG tag. 

Example 6 : new single step DNA purification procedure 

so [0197] We had observed earlier that the RFX complex binds to the MHC-ll promoters in a cooperative 
manner 18 ' 19 ' 22 together with two other transcription factors, NF-Y and X2BP (refs 16 - 17 ). These three factors individually 
bind DNA with a low affinity. For example, the half life of an RFX — DNA complex is only 2 to 4 minutes 18 ' 19 . However, 
RFX, X2BP, and NF-Y combine to form an extremely stable higher order muftiprotein complex on MHC-ll 
promoters 18 - 19 - 22 , extending the half life of the complex to more than 4 hours (ret 23 ). This particular feature was 

55 exploited to develop an efficient single-step DNA-affinity purification procedure. 

[0198] All steps were done at 4°C. A large scale nuclear extract (1 .9 g of protein) was prepared as described 44 from 
2 X 10 11 TK6 B cells. 130 mg of extract (5 mg/ml) were supplemented with MgCI 2 to 6 mM, KCl to 100 mM, NP-40 to 
0.01%, po!y(dldC) o poly(dldC) to 0.125 mg/ml, and single-stranded E. coli DNA to 0.125 mg/ml. The extract was 
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cleared by centrifugation and incubated with 1 mg of streptayidin magnetic beads (Dynal) coated with 30 mg of a 
WXX2Y DRA promoter fragment (nucleotides -150 to -47) biotinylated at the 5' end of the upper strand. Binding was 
allowed to proceed for 6 hours with end-over-end rotation. The beads were then washed 6 times with 40 ml buffer D (20 
mM HEPES pH 7.9, 100 mM KCI, 6 mM MgCfe, 1 mM DTT, 20% glycerol, 0.01% NP-40). Non-specific competitor DNA 

s (salmon sperm DNA and single-stranded E. coJi DNA, both at 0.125 mg/ml) was included during the last four washes. 
In addition, the last two washes contained 0.5 mg/ml specific competitor DNA in form of Hint l-digested plasmid pDR300 
(ret 45 ). Following the washes with DNA competitors, the beads were washed a further three times with buffer D con- 
taining increased salt concentration (0.2 M KCI). The proteins that remained bound to the biotinylated DRA promoter 
fragment were then eluted with buffer D containing 0.6 M KCI and 10 mM EDTA instead of MgCI 2 . 

10 [0199] To summarize, a biotinylated D RA promoter fragment containing the binding sites for RFX, X2BP, and NF-Y 
was coupled to streptavidin-coated paramagnetic beads and incubated with a crude B cell nuclear extract to allow for- 
mation of the multiprotein complex. The beads were then washed extensively with a buffer containing non-specific com- 
petitor DNA. Finally, the degree of purification was further increased by washing the beads with a buffer containing 
specific competitor DNA, namely the same DRA promoter sequence that was used for selection. The rationale for 

15 including the latter step in the purification procedure was that it should permit elimination of proteins that bind the DRA 
promoter less stably than the RFX-^(2BP— NF-Y higher order multiprotein complex. For example, other members of 
the X box binding protein family 24 , such as RFX1-RFX4, would be eliminated preferentially by the final washing step 
because they bind to the DRA promoter with significantly less stability, and thus with a considerably shorter half-life, 
than the multiprotein RFX— X2BP— NF-Y complex. 

20 [0200] This single-step affinity purification procedure turned out to be very efficient. As judged by western blotting 
experiments with anti-RFX5 antibodies and Bradford assays to quantitate total protein concentrations, the yield of RFX 
was greater than 80% and an enrichment with respect to the crude nuclear extract was at least 3000 fold. Analysis of 
the purified fraction by SDS-PAGE indicates that it contains only 1 1 different protein bands (lane 1 , Fig. 1). Immunopre- 
cipitation of the RFX complex from the aff inity purified fraction was a critical step in the identification of the different pro- 

25 tein components of this complex. Rabbit poiyclonal antisera for RFX5 and RFXAP have been described previously 10 . 
Immunoprecipitation was done according to standard protocols 46 . Analysis of the co-immunoprecipitated proteins was 
performed both by one-dimensional (lane 3, Fig. 1) and two-dimensional gel electrophoresis. Two bands were recog- 
nized by Western blot as RFX5 and RFXAP. Two additional protein bands were found at 120 kDa and 33 kDa respec- 
tively. The two proteins were isolated from a preparative gel. 

30 
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SEQUENCE LISTING 
<1X0> MEOlGEN SA c/o LENZ AND STAEHELIN 

<120> NEW TRANSCRIPTION FACTOR OF MHC CLASS II GENES, 

SUBSTANCES CAPABLE OF INHIBITING THIS NEW TRANSCRIPTION 
FACTOR AND MEDICAL USES OF THESE SUBSTANCES 

<130> B3991GD/LL 

<140> 98120085.0 
<141> 1998-10-23 

<160> 19 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 1 

ccgtacgcgt ctagaccatg gagcttaccc agcctgcaga 

<210> 2 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 2 

ttcgaattct cgagtgtctg agtccccggc a 

<210> 3 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
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<4Q0> 3 

ccgtacgcgt ctagaccatg gagcccactc aggttgc 



<210> 4 
<211> 32 
<212> DHA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :pri»er 
<400> 4 

ttcgaattct cgagtgcctg ggttccagca gg 



<210> 5 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 5 

ccagctctag actccaccac tctcaccaac 



<210> 6 
<211> 30 
<212> DUA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 6 

ccttcgaatt ctcgctcttt tgccaggatg 



<210> 7 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
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<400> 7 

ggttctctag attggcagca ctggggatag 30 

<210> 8 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<4D0> 8 

gctacgaatt ccagcagaca cagccaaaac ' 30 

<210> 9 
<211> 69 
<2J2> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 9 

ccgtacgcgt ctagaatgga ttacaaagac gatgacgata agatggagct tacccagcct 60 
gcagaagac 69 



<210> 10 

<211> 1345 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (418) . . (1200) 

<400> 10 - 

acgcagggaa ggaggcacac ccgggggtgg cgcagtgagg agggggcgcg acggccagga 60 
ggctggtgga gcgacaccca ggcaggagag ggggaagaac tctctccctt tctgaacccc 120 
cttttccttg agagacgagt tgggggagtc ctccacgcat tacccactcg ggccgcaaaa 180 
actcccttct ttagccctct gcccccgccc ttgcttataa gcctttgaga ccgcagaagg 240 
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gaccttgttg tggaacggga cggccaagag gaagccagat cgctgagggt ccggtctcca 300 

gtttgcctcc tgctatatcc attggaagag aaaagtttgt gacttgggcc cccaagtttt 360 

gagagaactg ggctttcggc gcggggggac agaggaggct cgtggggagc tttcccc 417 

atg gag ctt acc cag cct gca gaa gac etc ate cag acc cag cag acc 4 65 
Met Glu Leu Thr Gin Pro Ala Glu Asp Leu lie Gin Thr Gin Gin Thr 
15 10 15 

cct gec tea gaa ctt ggg gac cct gaa gac ccc gga gag gag get gca 513 
Pro Ala Ser Glu Leu Gly Asp Pro Glu Asp Pro Gly Glu Glu Ala Ala 
20 25 30 

gat ggc tea gac act gtg gtc etc agt etc ttt ccc tgc acc cct gag 561 
Asp Gly Ser Asp Thr Val Val Leu Ser Leu Phe Pro Cys Thr Pro Glu 
35 40 45 

cct gtg aat cct gaa ccg gat gec agt gtt tec tct cca cag gca ggc 609 
Pro Val Asn Pro Glu Pro Asp Ala Ser Val Ser Ser Pro Gin Ala Gly 
50 55 60 

age tec ctg aag cac tec acc act etc acc aac egg cag cga ggg aac 657 
Ser Ser Leu Lys His Ser Thr Thr Leu Thr Asn Arg Gin Arg Gly Asn 
65 70 75 80 

gag gtg tea get ctg ccg gec acc eta gac tec ctg tec ate cac cag 705 
Glu Val Ser Ala Leu Pro Ala Thr Leu Asp Ser Leu Ser lie His Gin 
85 90 95 

etc gca gca cag ggg gag ctg gac cag ctg aag gag cat ttg egg aaa 753 
Leu Ala Ala Glrx Gly Glu Leu Asp Gin Leu Lys Glu His Leu Arg Lys 
100 105 110 

ggt gac aac etc gtc aac aag cca gac gag cgc ggc ttc acc ccc etc 801 
Gly Asp Asn Leu Val Asn Lys Pro Asp Glu Arg Gly Phe Thr Pro Leu 
115 120 125 

ate tgg gec tec gec ttt gga gag att gag acc gtt cgc ttc ctg ctg 84 9 
He Trp Ala Ser Ala Phe Gly Glu He Glu Thr Val Arg Phe Leu Leu 
130 135 140 

gag tgg ggt gee gac ccc cac ate ctg gca aaa gag cga gag age gec 8 97 
Glu Trp Gly Ala Asp Pro His He Leu Ala Lys Glu Arg Glu Ser Ala 
145 150 155 160 

ctg teg ctg gee age aca ggc ggc tac aca gac att gtg ggg ctg ctg 945 
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Leu Ser Leu Ala Set Thr Gly Gly Tyr Thr Asp He VaX Giy Leu Leu 
165 170 175 

ctg gag cgt gac gtg gac ate aac ate tat gat tgg aat gga ggg acg 993 
Leu Glu Arg Asp Val Asp lie Asn He Tyr Asp Trp Asn Gly Gly Thr 
180 185 190 

cca ctg ctg tac get gtg cgc ggg aac cac gtg aaa tgc gtt gag gec 1041 
Pro Leu Leu Tyr Ala Val Arg Gly Asn His Val Lys Cys Val Glu Ala 
195 200 205 

ttg ctg gec cga ggc get gac etc ace acc gaa gec gac tct ggc tac 1089 
Leu Leu Ala Arg Gly Ala Asp Leu Thr Thr Glu Ala A3p Ser Gly Tyr 
210 • 215 220 

acc ccg atg gac ctt gec gtg gec ctg gga tac egg aaa gtg caa cag 1137 
Thr Pro Met Asp Leu Ala Val Ala Leu Gly Tyr Arg Lys Val Gin Gin 
225 230 235 240 

gtg ate gag aac cac ate etc aag etc ttc cag age aac ctg gtg ccc 1185 
Val He Glu Asn His He Leu Lys Leu Phe Gin Ser Asn Leu Val Pro 
245 250 255 

get gac cct gag tga aggccgcctg ccggggactc agacactcag ggaacaaaat 1240 
Ala Asp Pro Glu 
260 

ggtcagccag agctggggaa acccagaact gacttcaaag gcagcttctg gacaggtggt 1300 
gggaggggac ccttcccaag aggaaccaat aaaccttctg tgcag 1345 



<:210> 11 
<211> 260 
<212> PRT 

<213> Ho*fto sapiens 
<£400> 11 

Met Glu Leu Thr Gin Pro Ala Glu Asp Leu He Gin Thr Gin Glu Thr 
15 10 15 

Pro Ala Ser Glu Leu Gly Asp Pro Glu Asp Pro Gly Glu Glu Ala Ala 
20 25 30 

A3p Gly Ser A3p Thr Val Val Leu Ser Leu Phe Pro Cys Thr Pro Glu 
35 40 45 

Pro Val Asn Pro Glu Pro Asp Ala Ser Val Ser Ser Pro Gin Ala Gly 
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SO 55 60 

Ser Ser Leu Lys His Ser Thr Thr Leu Thr Asn Arg Gin Arg Gly Asn 
65 70 75 80 

Glu Val Ser Ala Leu Pro Ala Thr Leu Asp Ser Leu Ser He His Gin 
85 90 95 

Leu Ala Ala Gin Gly Glu Leu Asp Gin Leu Lys Glu His Leu Arg Lys 
100 105 no 

Gly Asp Asn Leu Val Asn Lys Pro Asp Glu Arg Gly Phe Thr Pro Leu 
115 120 125 

He Trp Ala Ser Ala Phe Gly Glu lie Glu Thr Val Arg Phe Leu Leu 
130 135 140 

Glu Trp Gly Ala Asp Pro His He Leu Ala Lys Glu Arg Glu Ser Ala 
145 150 155 160 

Leu Ser Leu Ala Ser Thr Gly Gly Tyr Thr Asp He Val Gly Leu Leu 
165 170 175 

Leu Glu Arg Asp Val Asp He Asn He Tyr Asp Trp Asn Gly Gly Thr 
180 185 190 

Pro Leu Leu Tyr Ala Val Arg Gly Asn His Val Lys Cys Val Glu Ala 
195 200 205 

Leu Leu Ala Arg Gly Ala Asp Leu Thr Thr Glu Ala Asp Ser Gly Tyr 
210 215 220 

Thr Pro Met Asp Leu Ala Val Ala Leu Gly Tyr Arg Lys Val Gin Gin 
225 230 235 240 

Val He Glu Asn His He Leu Lys Leu Phe Gin Ser Asn Leu Val Pro 
245 250 255 

Ala Asp Pro Glu 
260 
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<210> 12 \ 
<211> 260 
<212> PRT 

<213> Homo sapiens 
<400> 12 

Met Glu Leu Thr Gin Pro Ala Glu Asp Leu He Gin Thr Gin Gin Thr 
1 5 10 is 

Pro Ala Ser Glu Leu Gly Asp Pro Glu Asp Pro Gly Glu Glu Ala Ala 
20 25 30 

Asp Gly Ser Asp Thr Val Val Leu Ser Leu Phe Pro Cys Thr Pro Glu 
35 40 * 45 

Pro Val Asn Pro Glu Pro Asp Ala Ser Val Ser Ser Pro Gin Ala Gly 
50 55 60 

Ser Ser Leu Lys His Ser Thr Thr Leu Thr Asn Arg Gin Arg Gly Asn 
65 70 75 80 

Glu Val Ser Ala Leu Pro Ala Thr Leu Asp Ser Leu Ser He Hi3 Gin 
85 90 95 

Leu Ala Ala Gin Gly Glu Leu Asp Gin Leu Lys Glu His Leu Arg Lys 
100 105 Ho 

Gly Asp Asn Leu Val Asn Lys Pro Asp Glu Arg Gly Phe Thr Pro Leu 
115 120 125 

He Trp Ala Ser Ala Phe Gly Glu He Glu Thr Val Arg Phe Leu Leu 
130 135 140 

Glu Trp Gly Ala Asp Pro His lie Leu Ala Lys Glu Arg Glu Ser Ala 
145 150 155 160 

Leu Ser Leu Ala Ser Thr Gly Gly Tyr Thr Asp lie Val Gly Leu Leu 
165 170 175 

Leu Glu Arg Asp Val Asp He Asn He Tyr Asp Trp Asn Gly Gly Thr 
180 185 190 

Pro Leu Leu Tyr Ala Val Arg Gly Asn His Val Lys Cys Val Glu Ala 
195 200 205 

Leu Leu Ala Arg Gly Ala Asp Leu Thr Thr Glu Ala Asp Ser Gly Tyr 
210 215 220 
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Thr Pro Met Asp Leu Ala Val Ala Leu Gly Tyr Arg Lys .Val Gin Gin 
225 230 235 2 40 

Val lie Glu Asn His lie Leu Lys Leu Phe Gin Ser Asn Leu Val Pro 
245 250 255 



Ala Asp Pro Glu 
260 



<210> 13 
<211> 269 
<212> PRT 

<213> Murinae gen. sp-. 
<400> 13 

Met Glu Pro Thr Gin Val Ala Glu Asn Leu Val Pro Asn Gin Gin Pro 

Pro Val Pro Asp Leu Glu Asp Pro Glu Asp Thr Arg Asp Glu Ser Pro 
20 25 30 

Glu Asn Ser Asp Thr Val Val Leu Ser Leu Phe Pro Cys Thr Pro Asp 
35 40 45 

Ala Val Asn Pro Glu Ala Asp Ala Ser Ala Ser Ser Leu Gin Gly Ser 
50 55 60 

Phe Leu Lys His Ser Thr Thr Leu Thr Asn Arg Gin Arg Gly Asn Glu 
65 70 75 80 

Val Ser Ala Leu Pro Ala Thr Leu Asp Ser Leu Ser lie His Gin Leu 
85 90 95 

Ala Ala Gin Gly Glu Leu Ser Gin Leu Lys Asp His Leu Arg Lys Gly 
100 105 no 

Ala Cys Pro Ala Cys Thr Cys Leu Ser Gly Asn Asn Leu lie Asn Lys 
115 120 125 

Pro Asp Glu Arg Gly Phe Thr Pro Leu lie Trp Ala Ser Ala Phe Gly 
!30 135 140 

Glu lie Glu Thr Val Arg Phe Leu Leu Asp Trp Gly Ala Asp Pro His 
145 150 155 160 

He Leu Ala Lys Glu Arg Glu Ser Ala Leu Ser Leu Ala Ser Met Gly 
165 170 175 
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w 



20 



Gly Tyr Thr Asp lie Val Arg Leu Leu Leu Asp Arg Asp Val Asp He ~\ 
180 185 190 

Asn He Tyr Asp Trp Asn Gly Gly Thr Pro Leu Leu Tyr Ala Val Arg 
195 200 205 

Gly Asn His Val Lys Cys Val Glu Ala Leu Leu Ala Arg Gly Ala Asp 
210 215 220 

Leu Thr Thr Glu Ala Asp Ser Gly Tyr Thr Pro Met Asp Leu Ala Val 
225 230 235 240 

Ala Leu .Gly Tyr Arg Lys Val Gin Gin Val Met Glu Ser His He Leu 
245 250 255 

Arg Leu Phe Gin Ser Thr Leu Gly Pro Val Asp Pro Glu 
260 265 



<210> 14 

<211> 111 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (Ill) 

<400> 14 

acc eta gac tgg tgc cga ccc cca cat cct ggc aaa aga gcg aga gag 48 

Thr Leu Asp Trp Cys Arg Pro Pro His Pro Gly Lys Arg Ala Arg Glu 
15 10 15 



cgc cct gtc get ggc cag cac agg egg eta cac aga cat tgt ggg get 96 
40 Arg Pro Val Ala Gly Gin His Arg Arg Leu His Arg His Cys Gly Ala 

20 25 30 



45 



get get gga gcg tga 
Ala Ala Gly Ala 
35 



111 



<210> 15 
<211> 37 
<212> PRT 

<213> Homo sapiens 



29 



EP 0 995 798 A1 



<400> 15 . - 

Thr Leu Asp Trp Cys Arg Pro Pro His Pro Gly Lys Arg ALa Arg Glu 
1 5 10 15 

Arg Pro Val Ala Gly Gin Sis Arg Arg Leu His Arg His Cys Gly Ala 
20 25 30 

Ala Ala Gly Ala 
35 



<210> 16 

<211> 42 

<212> DNA 

<213> Homo sapiens 

20 <220> 

<221> exon 
<222> (31) . - (42) 

<400> 16 

25 ctggtggtat tgcccgcctc ctcctgccag gtg aca acc teg 42 



<210> 17 
30 <211> 74 

<212> DMA 
<213> Homo sapiens 

<220> 

35 <221> exon 

<222> (1) . . (27) 



<400> 17 

gag acc gtt cgc ttc ctg ctg gag tgg gtgcgtccca gcccagctgg 47 

40 

gcagctgggg ggttcccggg ggectta 74 



<210> 18 
<211> 220 
<212> PRT 

<213> Homo sapiens 



50 <400> 18 

Asn Ala Phe Asn Val Phe Thr Phe Val Phe His Leu Ala Glu Cys Asn 
15 10 15 
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He His Thr Ser Pro Ser Pro Gly lie Gin Val Arg His Val Xaa Thr~ 
20 25 30 

Pro Ser Thr Thr Lys His Phe Ser Pro He Lys Gin Ser Thr Thr Leu 
35 40 45 

Thr Asn Lys His Arg Gly Asn Glu Val Ser Thr Thr Pro Leu Leu Ala 
50 55 60 

Asn Ser Leu Ser Val His Gin Leu Ala Ala Gin Gly Glu Met Leu Tyr 
*5 70 . 75 go 

Leu Ala Thr Arg He Glu Gin Glu Asn Val He Asn His Thr Asp Glu 
85 , 90 95 

Glu Gly Phe Thr Pro Leu Met Trp Ala Ala Ala His Gly Gin He Ala 
100 105 no 

Val Val Glu Phe Leu Leu Gin Asn Gly Ala Asp Pro Gin Leu Leu Gly 
115 120 125 

Lys Gly Arg Glu Ser Ala Leu Ser Leu Ala Cys Ser Lys Gly Tyr Thr 
130 135 140 

Asp Xle Val Xaa Met Leu Leu Asp Cys Gly Val Asp Val Asn Xaa Tyr 
145 150 155 160 

Asp Trp Asn Gly Gly Thr Pro Leu Leu Tyr Ala Val His Gly Asn His 
165 170 175 

Val Lys Cys Val Lys Met Leu Leu Glu Ser Gly Ala Asp Pro Thr He 
180 185 190 

Glu Thr Asp Ser Gly Tyr Asn Ser Met Asp Leu Ala Val Ala Leu Gly 
195 200 205 

He Glu Val Phe Asn Arg Leu Leu Ser His lie Cys 
210 215 220 



<210> 19 
<211> 218 
<212> PRT 

<213> Murinae gen. sp. 
<400> 19 

Ala Ser Val Leu Phe Lys Ala Glu Cys Asn He His Thr Ser Pro Ser 
15 10 is 
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10 



15 



Pro Gly He Gin Val Arg His Val Tyr Thr Pro Ser Thr Thr Lys His 
.20 25 30 

Phe Ser Pro He Lys Gin Ser Thr Thr Leu Thr Asn Lys His Arg Gly 
35 40 45 

Asn Glu Val Ser Thr Thr Pro Leu Leu Ala Asn Ser Leu Ser Ala His 
50 55 60 

Gin Leu Ala Ala Gin Gly Glu Met Leu Tyr Leu Ala Thr Arg He Glu 
65 70 75 80 

Gin Glu Asn Val He Asn His Thr Asp Glu Glu Gly Phe Thr Pro Leu 
85 90 95 

20 

Met Trp Ala Ala Ala His Gly Gin He Ala Val Val Glu Phe Leu Leu 
100 105 110 

25 Gin Asn Gly Ala Asp Pro Gin Leu Leu Gly Lys Gly Arg Glu Ser Ala 

115 120 125 

Leu Ser Leu Ala Cys Ser Lys Gly Tyr Thr Asp He Val Lys Met Leu 
30 130 135 140 

Leu Asp Cys Gly Val Asp Val Asn Glu Tyr Asp Trp Asn Gly Gly Thr 
145 150 155 160 

35 

Pro Leu Leu Tyr Ala Gly His Gly Asn His Val Lys Cys Val Lys Met 
165 170 175 

Leu Leu Glu Asn Gly Ala Asp Pro Thr He Glu Thr Asp Ser Gly Tyr 
180 185 190 



45 



50 



Asn Ser Met Asp Leu Ala Val Ala Leu Gly He Glu Gly Cys Ser Asp 
195 200 205 

Tyr Met Leu Val Thr Asp Val Phe Arg He 
210 215 



Claims 

55 

1. Protein or peptide capable of restoring the MHC-Ii expression in cells from MHC-II deficiency patients in comple- 
mentation group B and comprising all or part of the amino-acid sequence shown in figure 3. 
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2. Protein or peptide according to claim 1 wherein the cells ate BLS 1 cell line or Na ceil line or Ba c II line. 

3. Protein or peptide according to claim 1 or 2 wherein the MHC-II is HLA-DR or H LA-DP or HLA-DQ. 

4. Protein or peptide comprising the amino acid sequence shown in figure 2 or a part thereof, or an amino acid 
sequence having at least 80 % and preferably at least 90 % homology with the illustrated sequences or said part 
thereof. 

5. Protein or peptide which is the homologous protein of a protein or a peptide of any one of claims 1 to 4 in another 
species than human. 

6. Protein or peptide of claim 5 wherein the species is pig. 

7. Antibodies capable of specifically recognising a peptide or protein according to any of claim 1 to 6. 

8. Antibodies according to claim 7 which are monoclonal. 

9. Antibodies according to claims 7 or 8 which are single chain antibodies. 

1 0. Antibodies according to anyone of claims 7 to 9 which are capable of inhibiting a function or an activity of a protein 
or a peptide of any one of claims 1 to 6. 

1 1 . Nucleic acid molecule encoding a protein or a peptide according to any one of claims 1 to 6 or a chain of antibodies 
according to any one of claim 7 to 10. 

12. Nucleic acid molecule according to claim 1 1 comprising ail or part of the nucleotide sequence illustrated in figure 2. 

13. Nucleic acid molecule comprising a sequence complementary to the nucleic acid molecules of any one of claims 
11 to 12. 

14. Nucleic acid molecule capable of hybridizing in stringent conditions, with the nucleic acid molecules of any one of 
claims 11 to 13. 

15. Nucleic acid molecule comprising at least one of the sequences illustrated in figures 3, or a sequence exhibiting at 
least 90 % homology with any of these sequences, or a fragment of any of these sequences. 

16- Nucleic acid molecule of anyone of claims 1 1 to 15 comprises all or part of the DNA molecule encoding the RFX- 
ANK gene of a species other than human. 

17. Nucleic acid molecule of claim 16 wherein the species is pig. 

18. Nucleic acid molecule comprising a sequence complementary to the nucleic acid molecule of anyone of claims 1 1 , 
12or14to17. 

19. Anti-sense molecule or ribozyme comprising a nudeic acid molecule of claim 13 or 18. 

20. Vector (comprising a nucleic acid molecule of any one of claims 11 to 19. 

21 . Process for identifying inhibitors which have the capacity to inhibit a function or an activity of a protein or a peptide 
according to any one of claims 1 to 6 or of a nucleic acid molecule according to any one of claims 1 1 to 18 com- 
prising detecting or measuring of said function or activity after intervention of the potential inhibitor. 

22. Process according to claim 21 wherein said function or activity is the expression of MHC class II molecules. 

23. Process according to claim 22 wherein the expression of MHC class II molecules is measured at the surface of 
cells. 

24. Process according to claim 22 wherein the expression of MHC class II is measured at the mRNA level or in the 



33 



cells. 



EP0 995 798A1 



25. Process according to claim 23 or 24 wherein said cells are B lymphocyte cell lines with constitutive expression of 
MHC class II or interferon gamma inducfcle cell lines. - - . 

26. Process according to claim 21 wherein said function or activity is the formation of RFX complex. 

27. Process according to claim 21 wherein said function or activity is the binding of the RFX complex to its DNA target. 

28. Process according to claim 27 wherein the measure or detection of the function or activity is done by gel retardation 
assay. 

29. Process according to claim 21 , wherein said function or activity is the interaction between the RFX complex and at 
least one of transcription factors X2BP, NF-Y and CIITA. 

30. Process according to claim 21 wherein said function or activity is the correction of the MHC II expression defect of 
cell lines from complementation group B. 

31 . Process for identifying inhibitors which have the capacity to inhibit the synthesis of a protein or a peptide according 
to any one of claims 1 to 6 or of a nucleic acid molecule according to any one of claims 1 1 to 18 comprising detec- 
tion or measuring a product which contributes to the synthesis of said protein or peptide after intervention of the 
potential inhibitor. 

32. Process according to claim 31 wherein said product is mRNA. 

33. Process according to any one of claims 21 to 32 comprising a preliminary screening of said potential inhibitor which 
consists in screening for the binding of molecules to a peptide or a protein of any one of claims 1 to 6 or nucleic 
acid molecule of any one of claims 1 1 to 1 8. 

34. Process according to claim 33 wherein the binding of molecules is detected by ligand-induced charge in protein 
conformation. 

35. Process according to claim 33 wherein the binding of molecules is detected by Jigand-induced displacement of mol- 
ecules first identified as binding to a peptide or a protein of any of claims 1 to 6. 

36. Process for identifying inhtoitors which have the capacity to inhibit a function, an activity or the synthesis of a protein 
or a peptide according to any one of claims 1 to 6 or of a nucleic acid molecule according to any one of claims 1 1 
to 18 comprising the designing of said inhftMtors on the basis of the three dimensional structure of a protein or a 
peptide according to any one of claims 1 to 6. 

37. Process according to claim 36 wherein the three dimensional structure is obtained using X-Ray structure analysis 
or spectroscopic methods. 

38. Inhibitors of a protein or a peptide according to any one of claims 1 to 6, or of a nucleic acid molecule according to 
any one of claims 1 1 to 18. 

39. Inhibitor accord ng to claim 38 identifiable by a process according to any one of claims 21 to 38. 

40. Inhibitor according to claim 39 which is an antibody according to any one of claims 7 to 8, a nucleic acid molecule 
according to claim 1 3 or 1 8 a derivative of a protein or a peptide according to any one of claims 1 to 6 or of a nucleic 
acid molecule according to any one of claims 1 1 to 18, or an anti-sense molecule or a ribozyme according to claim 
19. 

41 . Inhibitor according to claim 39 which is an antibody, a single chain antibody, a dominant negative mutant a protein, 
a peptide, a small molecular weight molecule, a rfcozyme or an anti-sense molecule. 

42. Nucleic acid molecule encoding an inhibitor of any one of claims 38 to 41 . 
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43. Inhibitor according to any one of claims 38 to 41 for use in therapy. 

44. Pharmaceutical composition comprising an inhibitor according to any CHie of 
tion with a pharmaceutically acceptable vehicle. 

5 

45. Use of an inhibitor according to any one of claims 38 to 41 for the preparation of a medicament for use in therapy 
or prevention of diseases associated with aberrant expression of MHC class II genes. 

46. Use of an inhibitor according to any one of claims 38 to 41 as an immunosuppressive agent 
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Figure 1 
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